FIGURE l 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGAAATGTGGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
T CAT AC ATT AC TGC AG T AACAC TCCACCAT AT AGACCCGGC T T TACCT TATATC AGT GACACT GG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
CTACCATTTATGTTCGTTATAAGCAAGTTCATGCTCTGAGTCCTGAAGAGAACGTTATCATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATATAT GT TT GT TCAGACCAT C CT T T CCTACCAAAT GCAGCCCAAAAT C CATGGCAAACAA.GTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
ATCAGTTTTGCACAGTGGCAATTTTGGGACTGATTTAGAACAGAAACTCCATTGGAACCCCGAGG 
ACAAAGGTTATGTGCTTCACATGATCACTACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTT 
GGT TT T T T CCTGAC T T ACAT T CGTGAT T T T C AGAAAAT T T C T T T ACGGGT GGAAGCCAAT T TACA 
T GGAT TAACCC T C TAT GAC AC TGCACC TT GCCC TAT TAACAAT GAAC GAACACGGCTACTTT CCA 
GAGATAT T TGA T GAAAGGATAAAAT AT TT CTGTAATGAT TAT GAT TC T CAGGGAT TGGGGAAAGG 
TTCACAGAAGTTGCTTATTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACT 
GATGAAT GC TGATAAT CAGGAAACATGAAAGAAGCCATT T GATAGAT TAT TC TAAAGGATAT CAT 
CAAGAAGACTATTAAAAACACCTATGCCTATACTTTTTTATCTCAGAAAATAAAGTCAAAAGACT 
ATG 
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FIGURE 2 



<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 3.39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAHLNIAAV 
LCIATIYVRYKQVHALSPEENVI IKLNKAGLVLGILSCLGLS IVANFQKTTLFAAHVSGAVLTFG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 

LLSRDI 

Important features: 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 

N-myristoylation sites. 

amino acids 57-63, 95-101, '99-105, 124-130, 183-189 
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FIGURE a 

CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCCATGAGT TT CC TCAT CGACT GCAGCAT CAT GAT TACCTCC C AGATACT ATTTT TTG 
GAT TTGGGTGGCTTTTCTT CAT G C G C C AAT T G T TT AAAGACTAT GAG AT AC GT CAGT AT GT TGT ACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACT GGAAAAT GAACCT GT GT GTAATT CT GCTGAT CCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
CATTCTCAGCCCAAAACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAG 
TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
C TCAGGAATGT GACT GACAC GGATATTCTAGCCCTGGAAC GGC GACTGCTGCAAACCAT GGATATGAT 
CATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAAGTGCATAACA 
AACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGGAAGTGAAAATCTTACT 
C TTATTCAACAGGAAGT GG AT GC T T T G GAAGAAT T AAGC AG G C AG CT TT T T CT GGAAACAGCT GATCT 
AT AT GCTACCAAGGAGAGAATAGAATACT CCAAAAC CTT CAAGGGGAAATATT TTAATT T T CTTGGTT 
ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
GGGAAAACGGAT CCTGT CACAAGAGGCATT GAGATCACT GTGAATTATCTGGGAATCCAATT T GATGT 
GAAGTTTT GGTCCCAACACAT TT CCTT CATT CTTGT T GGAAT AAT CAT C G T C ACAT C CAT C AG AG GAT 
TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 
GATATAAGAGGGGGGAAAAAT GGAACCAGGGCCT GACAT T TTATAAACAAACAAAAT GCTATGG TAGC 
ATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG 
AAC AT GAG AG G G AGAAC T AAC T C AAGAC AAT ACT C AGCAG AGAG CAT CCCGTGTG GAT AT G AGG C T G G 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
CTAT GGTAGCT GAGC CAAACACGTAGGATTT CCGTT TTAAGGT T CACAT GGAAAAGGTTAT AGCTTTG 
CCTT GAGAT TGACTCATTAAAAT CAGAGACT GTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 
ACT C TAGAGT C GACC T G CAGAAG CTTGGCCG C C AT G G C C CAAC T T GT T TAT T G CAGC TT AT AAT G 
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FIGURE A 

MS FL IDS S IMI TSQILFFGFGWLFFMRQLFKDYE IRQYWQVI FSVT FAFSCTMFEL 1 1 FE ILGV 
LNS S SRYFHWKMNLCVI LLI LVFMVP FY IGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAVNCPYTYMSYFLRNVTDTDILALERRLLQ 
TMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVS SVLL I RMSMPLE YRT 1 1 TEVLGELQFNFYHRWFDVI FLVSALSS I L FLYLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains : 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin -binding proteins 

amino acids 151-160 
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FIGURE 5 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACT C T CCAT CC GGAC TAGT TAT TGAGCATC TGC C TC TCATATCACCAG TGGCCAT C T GAGG T 
GTTTCCCTGGCTCTGAAGGGGTAGGCACG ATGG CCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
G T CATGCAGAAT T AT GGGGATCACCC T T GT GAGCAAAAAGGC GAACCAGCAGC T GAAT TT CACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
ATTATCACCACCAAAGATCCCATATTCAACACTCAAACTGCAACACAAACAACAGAATTTATTGT 
u CAGTGACAGTACCTACTCGGTGGGATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
\Z CTGCTCCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 
M GAAACTAGCACCATGTCTACAGAAACTGAACCATTTGTTGAAAATAAAGCAGCATTCAAGAATGA 
Q AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
Q) CAGCTGGT CT TGGAT T TT GCTATGTCAAAAGGTATGTGAAGGCCTTCCCTT TTACAAACAAGAAT 

H I CAGCAGAAGGAAAT GAT CGAAAC CAAAGT AGTAAAGGAGGAGAAGGCCAAT GATAGCAACC CTAA 

f\ TGAGGAATCAAAGAAAACT GATAAAAACCCAGAAGAGTCCAAGAGTCCAAGCAAAAC TACCGTGC 

y\ GATGCCT GGAAGCTGAAG T TTAGAT GAGACAGAAATGAGGAGACACACCT GAGGC TGGT TT CT T T 

UJ CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
Q CCCTTGGTTCCTAACTGGAATCAGCTCAGGACTGCCATTGGACTATGGAGTGCACCAAAGAGAAT 

GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
% % T TC TAGC C T GGC TAT G T CCT AATAATATCCCAC T GGGAGAAAGGAGT T TT GCAAAGT GCAAGGAC 

CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
m AAAGCC7\AGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
Q GCTC TGAAAGAGAAACACGT AT CCCAC CT GACAT GT CCT T C TGAGCC C GGT AAGAGCAAAAGAAT 

ffj GGCAGAAAAGT T TAGCCCC T GAAAGC CAT GGAGAT T C TCAT AACT TGAGACCTAATC TCTGT AAA 

% GC TAAAAT AAAGAAAT AGAACAAGGCT GAGGATACGACAGT ACAC TGT CAGCAGGGACTGTAAAC 

2* ACAGACAGGGTCAAAGTGT TT T C TC T GAACACATT GAGT T GGAATCACT GT T T AGAACACACACA 

III C T T AC TTTTTCTGGTCT C T AC C AC T G C T GAT AT T T T C T C TAG G AAAT AT AC T T T T AC AAG T AAC A 

AAAAT AAAAAC T C T T AT AAAT T T C TAT T T T TAT C T GAG T T AC AG AAA T GAT T AC T AAG G AAG AT T 
ACTCAGTAATT TGTT TAAAAAGTAATAAAAT TCAACAAACATT TGCTGAATAGCTAC TAT AT GTC 
AAGTGCT GT GCAAGGTAT TACAC TC TG TAAT TGAATATT AT TC C T CAAAAAAT TGCACATAGTAG 
AACGC TAT C TGGGAAGCTAT T T T T T TCAGT T T T GATATT T CTAGC T TAT C T ACT TCCAAAC T AAT 
T T T TAT T T T TGC T GAGAC TAAT C T T AT T CAT T T T C T C TAAT AT GGC AACC AT TAT AAC C T TAAT T 
TATTATTAACATACCTAAGAAGTACAT TGT TACC TCTATATACCAAAGCACATT TTAAAAGTGCC 
ATTAACAAATGTATCACTAGCCCTCCTTTTTCCAACAAGAAGGGACTGAGAGATGCAGAAATATT 
T GTGACAAAAAA.T TAAAGCAT T TAGAAAAC T T 
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FIGURE 6 



MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
LS LAGKDQVE TALKAS FE T CS YGWVGDG FWI S R I S PNPKC GKNGVGVL I WKVP VSRQ FAAYC YN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKL I CVTEVFMET S TMS TETE PFVENKAAFKNEAAGFGGVPTALLVLALL FFGAAAGLGFCYVK 
RYVKAFPFTNKNQQKEMIETKWKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAATTGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCTGCTA 
AAGC AT CAT C AG AAG T G AAC C T G GC AAAC T T AC C T C C C AG C T AT C AC AAT G AGAC C AAC AC AG AC 
ACGAAGGTTGGAAATAATACCATCCATGTGCACCGAGAAATTCACAAGATAACCAACAACCAGAC 
TGGACAAATGGTCTTTTCAGAGACAGTTATCACATCTGTGGGAGACGAAGAAGGCAGAAGGAGCC 
I , ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
%\ ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
% GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
y ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
£M ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
til CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
\\ CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
hj GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 
£*j C CAGGAGC T GGAGGACC T GGAGAGGAGCC T GAC TGAAGAGATGGCGC T GGGGGAGCC TGCGGCTG 

*** CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 
L TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
?jf CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
U'l CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
0 GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
y| TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 
fy TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
fjj CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 

AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
. CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 
C T AC T GT TAGGAACAGCAGT GT T C TCACAG T GTGGGGCAGCC GT C CT T CT AATGAAGACAAT GAT 
ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 
TACAGGTTAACCTGCAGAAACAGTACTTAGGTAATTGTAGGGCGAGGATTATAAATGAAATTTGC 
AAAATCAC T TAGCAGCAACTGAAGACAATTATCAACCACGTGGAGAAAATCAAACCGAGCAGGGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATG 
T T T TCAGGTGT CATGGACT GTT GCCACCAT G TAT TCAT CCAGAGT TC T T AAAG TT TAAAGT T GCA 
CAT GAT T G TAT AAG CAT GCTTTCTTT GAG T T T T AAAT T AT G TAT AAAC AT AAG T T G CAT T T AG AA 
ATCAAGCATAAATCACTTCAACTGCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 



MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RS AVEEME AEEAAAKAS SEWLANL PPS YHNE TNTDTKVGNNT IHVHRE I HKI TNNQTGQMVFS E 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RS LTEEMALGE PAAAAAALLGGEE I 

Signal sequence: 

a 

z: amino acids 1-19 

jr| N-glycosylation site. 

%l amino acids 96-100, 106-110, 121-125, 204-208 

m ■ ■ .. . 

Casein kinase II phosphorylation site. 

PI ■ amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 

Iff 327-331 

N-myristoylation site. 

amino acids 202-208, 217-223 

m 

Amidation site. 

amino acids 140-144 
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FIGURE o 



CGGA CGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCT7UVTCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATAT TGACCACAAGCT CAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCT GT TT 
CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
AGAT T CTGGGCAAGAAGAACGAGGC GCCC CCGGACAACAAGAAGAGGAAGAAGTGACAGCT CCGG 
CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA 
AAGAT T T TAT TAAAGATAT T T T GT TAAGT C 
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FIGURE 10 

RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NWVLALGQCVLAGAFASFYWAFHKPQDIPTFPLISAFIRTLRYHTGSLAFGALILTLVQIT^RVIL 
E Y I DHKLRGVQNP VARC IMCC FKCCL WCLEKF I KFLNRNAY IM I AI YGKNFCVS AKNAFMLLMRN 
I VRVWLDKVTDLLLFFGKLLWGGVGVLS FFFFSGRI PGLGKDFKS PHLNYYWLP IMTS I LGAY 
VI ASGFFS VFGMCVDTL FLC FLE DLERNNGS LDRP Y YMSKS LLK I LGKKNE AP P DNKKRKK 

Important features : 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 27 6-28 8 

Ammonium transporters proteins. 

amino acids 204-231 

N-myr i s toy 1 a ti on s i te s . 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGGCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 

0 CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 

m 

*[[ TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 

1 B | CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
P TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
« GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 

TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 

til GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 

Ul 

^ ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
fjf ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 

CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGC TGA GGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 



5 
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FIGURE 12 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 

YKLPWVCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 

NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 

ECDSRAWYAGLFFFTLLFYLLS IAAVAMFMYYTEPSGCHEGKVFI SLNLTFCVCVS IAAVLPKV 

QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 

VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 

SFFHFCLV^SLHVMMTLTNWYKPGETRKMI^ 

FS 

Signal sequence: 

amino acids 1-20 



kU Transmembrane domains: 

a 



amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 



n 



CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGT TGGAAAAAGAC TCC T GTAACCC T CCT CCAGGATGAACCAC C TGCCAGAAGACAT GGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
T GAAT GGAGGCATTGAGAACACAT TAGAGAAGGAGGTGAT GCAGTAT GAC TACTATT CT TCAT AT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CTTGCCTGGATTGAGACGTGGTTCCTGGATTTCAAAGTGTTACCTCAAGAAGCAGAAGAAGAAAA 
CAGACTCCTGATAGTTCAGGATGCTTCAGAGAGGGCAGCACTTATACCTGGTGGTCTTTCTGATG 
GTCAGTTTTATTCCCCTCCTGAATCCGAAGCAGGATCTGAAGAAGCTGAAGAAAAACAGGACAGT 
GAGAAACCACT T T TAGAAC TATGAG TACT ACT T T TG T TAAATG T GAAAAAC CC TCACAGAAAGTC 
ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 
CAC TGCT GGCT T TAT T GAACAGC TAAT AAAGAT T TAT T TAT TG TAATACCT CACAAACGTT GTAC 



f H CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
Q T CAGT GAGACTGAGC CT GATGTGT TAACAAATAGGT GAAGAAAG TCT T GT GCTGT AT TCCT AAT C 

31 AAAAGAC T TAATATATT GAAGTAACACT TT T T TAGTAAGCAAGATACC T T T T TAT T T CAAT TCAC 

agaatggaatttttttgtttcatgtctcagatttattttgtatttcttttttaacactctacatt 

III 

tcccttgttttttaactcatgcacatgtgctctttgtacagttttaaaaagtgtaataaaatctg 
acatgtcaatgtggctagttttatttttcttgttttgcattatgtgtatggcctgaagtgttgga 
c t t gcaaaaggggaagaaaggaat tgcgaatacatgtaaaatgtcac cagacat t tgt at tat t t 
ttatcatgaaatcatgtttttctctgattgttctgaaatgttctaaatactcttattttgaatgc 
acaaaatgacttaaaccattcatatcat gt ttcct t tgcgttcagccaat ttcaat taaaat gaa 
ctaaat taaaaa 
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FIGURE 14 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWI I ELNVNGG IENTLEKEVMQYDY YS S Y FD I FLLAVFRFKVL I LAYAVCRLRHWWAI ALT T 
AVTSAFLLAKVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 0 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 
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FIGURE 15 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCGAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
u CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
% % GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
y TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
p CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
01 TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
H| TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
™\ TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
] V \ CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
ill CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
Q TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 

CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
1.^ ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 

GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
Ul GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
Q AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
m GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
SI GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
y GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
01 CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 

GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGGAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGC TGTGT GACT C TAGTC T TGGCCCCAGGAAGCGAAGGAACAAAAGAAAC TGGAAAGGAAGAT GC 
T T TAGGAACAT GTTTTGCTTTTT TAAAAT AT AT AT AT T T AT AAGAGAT CC T T TC CCAT T TAT T C T 
GGGAAGAT GT T T T TCAAACT CAGAGACAAGGAC TTTGGTTT T TGTAAGACAAACGATGATAT GAA 
GGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGS FAGLPGLQLLDLSQNQI ASLPSGVFQPLANLSNLDLTANRLHE I TNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTAOTE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSWFGPWVRESHVTLASPEE 
TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 
LIJ^AAVGAAYCVRRGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSE 
CEVPLMGFPGPGLQSPLHAKPYI 

Important features: 
Signal peptide: 

amino acids i-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATGCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGATGAATCCT TAGAT T CCAAGACT AC T T T GACATCAGATGAGT CAGTA 
AAGGACCATAC TAC TGCAGGCAGAGTAGT TGCTGGTCAAATAT TTCT TGAT T CAGAAGAATC TGA 
AT TAGAATCCT C TAT TCAAGAAGAGGAAGACAGCCT CAAGAGCCAAGAGGGGGAAAGT GT CAC AG 
AAGATATCAGCTTTCTAGAGTCTCCAAATCCAGAAAACAAGGACTATGAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
TTTCCTAGATAAGGAGTATGATGAATGTACATCAGATGGGAGGGAAGATGGCAGACTGTGGTGTG 
CTACAACCTATGACTACAAAGCAGATGAAAAGTGGGGCTTTTGTGAAACTGAAGAAGAGGCTGCT 
AAGAGAC GGCAGAT GCAGGAAGCAGAAAT GAT G TAT CAAAC T GGAAT GAAAATCC T TAAT GGAAG 
CAAT AAGAAAAGCCAAAAAAGAGAAGCATATC GGTAT CT CCAAAAGGCAGCAAGCATGAAC CATA 
CCAAAGC CC TGGAGAGAGT GT CATATGCT CT T TTAT T TGGT GAT TAC T T GC CACAGAATAT CCAG 
GCAGCGAGAGAGAT GT T T GAGAAGC TGAC TGAGGAAGGCTCTCCCAAGGGACAGACTGC TC T T GG 
CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTTTAGTGGAAGGCT 
AATAAT AT T AACAT CAGAAGAAT T T GT GGT TTATAGCGGC CACAAC T T TT T CAGC TT TCAT GATC 
CAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAAATTCTTGTTAATGGATAT 
AACACAT GGAAT C TACAT G TAAATGAAAGT TGGT GGAGTC CACAAT T T TT C T T TAAAAT GAT TAG 
TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 
GAAT T GT CAGAATCAT T T T TTACAT TAGAT TATCATAAT T T TAAAAAT T T T T CT T TAGT T T T TCA 
AAAT T T T GT AAATGGT GGC T ATAGAAAAACAACATGAAATATTAT ACAATAT T T TGCAAC AATGC 
C C T AAGAAT T G T TAAAAT T C AT GGAG T TAT T T G T GC AGAAT GAC T C C AGAG AGC T C TAC T T T C T G 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT TAACTCATT TT TAATAAAAT TATGTCTAAGAT TAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEMMYQTGMKILNGSNKKSQKR 
EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
SSQAKALVYYTFGALGGNLIAHMVLVSRL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C- terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 1Q 

AATTCAGATTTTAAGCCCATTCTGCAGTGGAATTTCATGAACTAGCAAGAGGACACCATCTTCTT 
GTATTATACAAGAAAGGAGTGTACCTATCACACACAGGGGGAAAAATGCTCTTTTGGGTGCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACAT T T T TATCACTGGATGTGACTCGGGC T T TGGAAAC T TGGCAGCCAGAACT TT T GAT 
AAAAAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 
AAC C T C AG AG AG AC T T C G T AC TGTGCTTCTG GAT G T G AC C GAC C CAGAGAAT G T C AAG AGG AC T G 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
CCCGGCGTGCTGGCTCCCACTGACTGGCTGACACTAGAGGACTACAGAGAACCTATTGAAGTGAA 
CCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTCCTTTGGTCAAGAAAGCTCAAGGGAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 
ATGCATT GAACCAGGATT GT T CAAAACAAACT TGGCAGATCCAGTAAAGGTAAT T GAAAAAAAAC 
TCGCCATT TGGGAGCAGCTGTC TCCAG AC AT C AAAC AAC AAT AT GG AG AAGGTTACAT T GA&AAA 
AGTC TAGACAAAC T GAAAGGCAATAAAT CC TAT G TGAACAT GGACCTC TCT CCGGT GGTAGAGTG 
CATGGACCACGC TCTAACAAGT C TCT TCCC TAAGAC TCAT TATGCCGCT GGAAAAGAT GCCAAAA 
TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 
GCAGAGCTGGCTAATCCCAAGGCAGTG TGA CTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
AT CATCTC T TAT C TAAAT AT TAAAAGAT AAGTCAACCCAAAAAAAAAAAAAAZVAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 20 

MLFWLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALKAETSERLRTVLLDVTDPENVKRTAQWVKNQVGEKGLWGLINNAGVPGVLAPTDWLTLEDY 
REPIEVNLFGLISVTLNMLPLVKKAQGRVINVSSVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSCIEPGLFKTNLADPVKVIEKKLAIWEQLSPDIKQQYGEGYIEKSLDKLKGNKSYVNMD 
LSPWECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKMLANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 

CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGTGAAGCCAAGAACAGCAT TACTGAT TCCCAAATGGATGATGT TGAAGT TGTT TATACAAT TGA 
CAT T C AGAAAT AT AT T CCAT GC TAT C AGC T T T T T AGC T T T T ATAAT T C T T C AGG C GAAG T AAAT G 
AGCAAGCACTGAAGAAAAT AT TATCAAAT GT CAAAAAGAAT GTGG TAGGT T GGTACAAAT TCCGT 
CGTCATTCAGATCAGATCATGACGTTTAGAGAGAGGCTGCTTCACAAAAACTTGCAGGAGCATTT 
T TCAAAC CAAGAC CTTGTTTTTCTGC TAT T AAC ACCAAGT AT AAT AAC AG AAAGC TGC T C T AC T C 
ATCGACTGGAACATTCCTTATATAAACCTCAAAAAGGACTTTTTCACAGGGTACCTTTAGTGGTT 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
TT TTAGCCGAGCAGTACAAACACACAGCT CTAAATT TT TT GAAGAAGATGGATCCT TAAAGGAGG 
TACATAAGAT AAAT GAAAT GTATGCT T CAT T ACAAGAGGAAT TAAAGAGTAT ATGCAAAAAAGTG 
GAAGACAGTGAACAAGCAGTAGATAAACTAGTAAAGGATGTAAACAGATTAAAACGAGAAATTGA 
GAAAAGGAGAGGAGCACAGATTCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
AT GT C T T T AAAAAAT AGACAT G T T T C T AAAAG T AGC T G T AAC T AC AAC CAC CAT C T C GAT G TAG T 
AGACAATCTGACCTTAATGGTAGAACACACTGACATTCCTGT^AGCTAGTCCAGCTAGTACACCAC 
AAAT CAT T AAGC AT AAAG C C T TAG AC T TAG AT GACAG AT G G C AAT T C AAG AG AT CTCGGTTGT T A 
GATACACAAGACAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAAT 
GAGCAGCCCAGAAACAGATGAAGAAATTGAAAAGATGAAGGGTT TTGGTGAATAT TCACGGTCTC 
C TACAT T T TGA. T CCT T T TAACC T TACAAGGAGAT T T T T TT AT T T GGC T GAT GGGTAAAGCCAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
T GT T TGCAGT AATACACAGATAAC TC T TAGT GCAT T TAC T T CACAAAGT AC T T T T T CAAACATCA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACATT CT TTAGAAT T GGAAAAG TGAGACCAGGC ACAGT GGC TCACACCT G TAATCCCAGCACT 
TAGGGAAGACAAGTCAGGAGGATTGAT T GAAGC TAGGAGT TAGAGACCAGCCTGGGCAACGTAT T 
GAGACCATG TC TAT TAAAAAATAAAATGGAAAAGCAAGAAT AGCC T TAT T T T CAAAATAT GGAAA 
GAAAT T TAT AT GAAAATT TATC TGAGT CAT TAAAAT T C TC C T T AAGTGAT AC TT T T T TAGAAGTA 
CATTATGGCTAGAGTTGCCAGATAAAATGCTGGATATCATGCAATAAATTTGCAAAACATCATCT 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQLFSFYNSSGEWEQALKKILSNWKNWGWYKFRRHSDQIMTFRERLLHKNLQEHFSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
Q I QAAREKNI QKDPQENI FLCQALRT FFPNSE FLHSCVMS LKNRHVSKS S CNYNHHLDWDNLTL 
MVEHTDI PEAS PAS TPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEEIEKMKGFGEYSRSPTF 

; ; 

{ r . 

*Z*J. Important features: 

l» ^ 

Tp Signal peptide: 

| * | amino acids 1-19 



feljF N-glycosylation sites. 

amino acids 75-79, 322-326 

if! N-myristoylation site. 

Q . amino acids 184-154 

CO 

J;;; Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACC ATG GCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCCAGGGTTCTACCTGCTGCTCT 
CAAGAGAT GGAGGAGAAGTACAG C CT GCAAAGTAAAGAT GATTT CAAAAGT GT GGT CAGC GAACAGT G CAATCAT TTG 
CAAG CT GT CTTTGCTT CAG GT TAC AAGAAGTTT GAT GAAT T CTT CAAAGAACTAC T TGAAAAT GCAGAGAAAT C CCTG 
AAT GATAT GTTT GT GAAGACAT AT GGCCAT T TATAC AT GCAAAAT T CT GAGCTAT TTAAAGAT CT CTT C GTAGAGT TG 
AAACGTTACTAC GT GGT GGGAAAT GT GAAC CTGGAAGAAAT GCT AAAT GAC TTCTGGGCTCGCCTCCT GGAGCGGATG 
TTCCGCCTGGT GAACT CC CAGTAC CACT TTAC AGAT GAGT ATCT GGAAT GT GT GAGCAAGT AT AC GGAGCAGCT GAAG 
CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
TTAGC GGT T GCGG GAGAT GT C GT GAGCAAGGTCT CC GT GGTAAAC C CC ACAGC CCAGT GT AC CCAT GCC CT GT T GAAG 
ATGATCTACT GCTCCCACTGCCGGGGTCTCGTGACTGT GAAGCCAT GTTAC AACTACT GCT CAAACAT CAT GAGAGGC 
T GTTT GGC CAACCAAG GGGAT CT C GATT TT GAATGGAACAATTT CATAGAT GCTATGCT GATGGTGGCAGAGAGGCTA 
GAGGGT CCT TT CAAC ATT GAATCG GT CAT GGAT CCCAT CGATGT GAAGATTT CT GATGCTATTAT GAACATGCAGGAT 
AAT AGT GTT CAAGT GT CT CAGAAGGT TT T CCAGGGATGT GGAC C CC CCAAGCCCCT CC CAGCT GGACGAATTT CT CGT 
TCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGGAACGCCCAACCACAGCAGCTGGCACT 
AGTTTGGACC GACT GGTTAC T GAT GT CAAGGAGAAACT GAAACAGGCCAAGAAATT CT G GT CCTCCCT TCCGAGCAAC 
GTTT GCAAC GATGAGAGGAT GGCT GCAGGAAACGGCAAT GAGGAT GAC T GTTGGAATGGGAAAGGCAAAAGCAGGTAC 
CTGT TT GCAGT GACAGGAAAT GGATTAGC CAACCAGGGCAACAACC CAGAG GT CCAGGTTGACAC CAGC AAACCAGAC 
ATACTGATCCTTCGTCAAATCATGGCTCTTCGAGTGATGACCAGCAAGATGAAGAATGCATACAATGGGAACGACGTG 
GACTTCTTTGATATCAGTGATGAAAGTAGTGGAGAAGGAAGTGGAAGTGGCTGTGAGTATCAGCAGTGCCCTTCAGAG 
TTTGACTACAATGCCACTGACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCA 
CAGGCCTAC CT CCT CACT GT CTT CT GCAT CT T GTTCCT GGTT AT GCAGAGAGAGT GGAGATAATT CT CAAACT CT GAG 
AAAAAGTGTTCATCAAAAAGTTAAAAGGCACCAGTTATCACTTTTCTACCATCCTAGTGACTTTGCTTTTTAAATGAA 
TGGACAACAATGTACAGTTTTTACTATGTGGCCACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGG 
AGGAAAAGGGACTGTGCATTGAGTTGGTTCCTGCTCCCCCAAACCATGTTAAACGTGGCTAACAGTGTAGGTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
T CT GAAAT ATTAAAT AGCT GTACAGAAGCAGGT T TTAT TT ATCATGT T ATCT TATTAAAAGAAAAAGCC CAAAAAGC 
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FIGURE 24 

MZVRFGLPALLCTLAVLSAALIAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGST 
CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLFVELKRYYWGNWLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 
QLKP FGDVPRKLKLQVTRAFVAART FAQGLAVAGDWSKVS WNP TAQCTHALLKM I YCSHCRGL 
VTVKPCYNYCSNIMRGCLANQGDLDFEWNNFIDAMLMVAERLEGPFNI E SVMDP IDVKI S DAIMN 
MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 
KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 
KSANEKADSAGVRPGAQAYLLTVFC ILFLVMQREWR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 



CTCGCCCTCAAATGGGAACGCTGGCCTGGGAGTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
CTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
TCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCATTCCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
AGCGCCCAC T C T T CCAAT T AAACAT TCT CAGCCAAGAAGACAG T GAGC AC AC C TACCAG AC AC T C 
TTCTT.CTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAA7VAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGCTAGTGTCATTTAACCTTAAATGCAATCAGGAAAGTAGC7\AACAGAAGTCAATAAATATTTTT 
AAATGTCAAAAAAAAAAAAAAAAAA 
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FIGURE 26 

MKVLISSLLLLLPmmSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 46-52 
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FIGURE 27 

GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGT T C CT GACAAAGAGCAT GGC T GAGAACAGGCACC T CCAATAT GAGCG.GT T TGT GGT GGC T CCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
GAGCCCACCTGGAAACACATTGGGGATGGCTGCTGCCTCACCAGAGAGACCTGGAAGGATCTTGA 
GAACGCCCAGTTCTCCGAAATCGAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
T TCCCCAGCCTC CAAT TAGAACAAGCCACCCACCAGC CTAT CTATCT T CCAC TGAGAGGGACC TA 
GC AG AAT G AG AG AAGAC AT T CAT G T AC CAC C T AC TAG TCCCTCTCTCCC C AAC C T C T G C C AG GGC 
AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
AT GC C AGAG CAAGAC T C AAAGAGGC AGAG GTTTTGTTCT C AAAT AT T T T T T AAT AA&T AG AC GAA 
ACCACG 
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FIGURE 28 

MDILVPLLQLLVLLLTLPLHimLLGCWQPLCKSYFPYmAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQLADGSMDVWCTLVLCSVQSPRKVLQEVRRVLRPGGVLFFWEHVAEPYGSWAFMWQQVFEPTW 
KHIGDGCCLTRETWKDLENAQFSE IQMERQPPPLKWLPVGPHIMGKAVKQS FPS SKAL I CS FPSL 
QLEQATHQPIYLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-myr i s toyl a t ion s i tes . 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 

CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGC T GGGAGATAGGGAACAGAAGAGGGTAG TGGGT GGGC TAGGGGGGC TGCCT TAT T TAAA 
GTGGTTGTT TAT GAT T CT TAT AC TAAT T T AT ACAAAGATAT TAAGGC CCT GT TCATTAAGAAAT T 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGT TAAAAG C TGAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 3Q 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 

l si Growth factor and cytokines receptors family: 

til amino acids 3-18 

g 

m 
yj 
H 
y) 
0 

a 
m 
a 
m 
a 
fli 
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FIGURE 21 

GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTGACTGTGTCCCAGGCTACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTT TATGG T GGC TG TAT TAT GAC TAT ACC AACGACC T CAGCAT AGAAT T GGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
CTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATTGACAGTTGAGCTTTTCCAAATCAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GATTCCGAGAATCATTGTCATGTACATGCAAAACGCACTGAAAGAACAGCAGCATGGTGCATTGT 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
AACCAGAATG CATAT AC TAC AAC TGC TAT T AAT GGGAC AGAT T T C TG T ACATCAGCAAAAGAT GC 
ATTCAAAATCTTGTCCAAGAACTCAAGTCACTTTACATCTATTAACTGCTTTGGAGACTTCATAA 
TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCAAGAATTTCTGAGTTTCGTA 
AAAAGGAGCAACAAAT T AAAC AAT GCAAGGGCACAGCAGGACAAGCAC T CAT TAAGGAAT GAGGA 
GGGAACAGAACTCCAGGCCATTGTGAGATAGATACCCAT T TAGGTATCT GTACC TGGAAAACAT T 
T CCT T CT AAGAGC CAT T T ACAGAATAGAAGATGAGACCACT AGAGAAAAGT T AGTGAAT T T T T T T 
T TAAAAGACC TAATAAACCC TAT T C T TCCT CAAAA 
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FIGURE 32 

MSGRDTILGLCIIJ^LSLAMMFTFRFITTLLVHIFISLVILGLLFVGGVLWWLYYDYTNDLSIE 
LDTERENMKCVLG FAI VS TGI TAVLL VLI FVLRKRI KLTVEL FQ I TNKAI S SAP FLL FQPLWT FA 
I L I F FWVLWVAVLLS LGT AGAAQVME GGQVE YKPL S G IRYMWS YHL IGLIWTSEFI LACQQMT I A 
GAWTCYFNRSKNDPPDHP I LS SLS I LFFYHQGTWKGS FLI S WRI PRI IVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHSFLSVFETVLDALFLCFA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKL3STNARAQQDKHSLRNEEGTELQAIVR 

Important features: 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains : 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 

N-myr i s toyl a ti on s i tes . 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 33 



yj 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAATT T GT AT GGGATT T GT GATGCAGGAAAGCC TAAGGGAAAAAGAATAT T CATTC T GT GT GGT 
GAAAATTTTTTGAAAAAAfiAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATTTATGAGGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
ATTCAAACAAAGAAACGGCAAAGAAGATTAAAAGGCCCAAGTTCACTGTGCCTCAGATCAACTGC 
GATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATGTCCAGCAGGATGCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGT TACAAAGGGAG T TAT TCCAACGGTGT CCAATCG T TAT CCC TACCACGATGGAGAGAAT CCT T 
TAT CGT CT TAGAAAGT AAACCCAAAAAGGGT GTAACC TACCCAT C AGC T C T TACATACTC AT CAT 
CGAAAAGTCCAGCTGCCCAAGCAGGTGAGACCACAAAAGCCTATCAGAGGCCACCTATTCCAGGG 
ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
H CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
CJ GCCACAGGAGCCAGGAGATGGATCTCTGGTCCACTGCCACCTACACAAGCAGCCAAAACAGGCCC 
AGAGCTGATCCAGGTATCCAAAGGCAAGATCCTTCAGGAGCTGCCTTCCAG7^ACCTGTTGGAGC 
GGATGTCAGCCTGGGACTTGTTCCAAAAGAAGAATTGAGCACACAGTCTTTGGAGCCAGTATCCC 
TGGGAGAT C C AAAC T GC AAA&T T GAC TTGTCGTTTT T AAT T GAT GGGAGC ACCAGCAT T GGCAAA 
CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
S\ CGGTCCACTGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACAC 
U| ACACGAAT TCTCGAGAT CTGAAGACAGCCATAGAGAAAAT TACT CAGAGAGGAGGACT T T C TAAT 

P>5 GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 

GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
GAC T T G C GAG AG AG T C AG G AAT C AAC AT TTTCTTCATCACCATT G AAG GTGCTGCT G AAAAT GAG 
C| AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 

GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 
TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 
TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GTCCAAGCCCAACAAGAGGAAGTTAATGATCCTCATCACCGACGGGAGGTCCTACGACGACGTCC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCTAGAAGTCATTGCCACTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGA 
GT T TGACAACC T CCAT CAGTATGTCCCCAGGAT CATC CAGAACAT T T GTACAGAG T T CAAC T CAC 
AGCC T CGGAAC TGAA T T CAGAGCAGGCAGAGCACCAGCAAGT GC TGC T T TACTAACTGACGT GT T 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 
T GAT CAC AAAC G TAT AG AAT GAG C C AAAAG GC T AC AT CAT G T T G AG GG T G C T GG AGAT T T T AC AT 
TT TGACAAT T GT TT T CAAAATAAATGTT CGGAATACAGT GCAGC CC T TACGACAGGC TTACG T AG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
T T T T G T CAT GAC AAT G T AGGAAT T GC T GAAT T AAAT G T T T AGAAGGAT G AAAAAT AAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAMAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 34 

MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
ESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVS LGL VPKEEL S T QS LE PVS LGDPNCKI DLS FLIDGSTSI GKRRFR I QKQLLADVAQALD I 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RSGAPmAAAMVDGWPTDKVEEASRLARESGINIFFITIEGAAENEKQYWEPNFANKAVCRTNG 
FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMI LI TDGRSYDDVRI PAMAAHLKGVI TYAIGVAWAAQEELEVIATHPARDHS FF 
VDE FDNLHQ Y VPR I IQNICTEFNSQPRN 

Important features : 
Signal peptide: 

amino acids 1-2 6 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Ami da ti on site. 

amino acids 304-308 
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FIGURE 35 



CCGAGC AC AGGAGAT T GCCT GCGT TT AGGAGGT GGCTGCGTT GT GGGAAAAGCT AT CAAGGAAG AAAT TGC 
C AAACC AT GTCTTTTTTTCTGTTT TC AG AG TAGTT C AC AAC AGAT CT G AGT GT T TT AAT T AAGC AT GG AAT 
AC AG AAAAC AAC AAAAAACT T AAG CT T T AATT T CAT CT GGAATT CC AC AGTT T T CT T AGCTCCC TGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GG AT GT CACTGAGATCCC TCAAAT GGAGCC TCCT GC TGCTGT CACTCCTG AGT T TC TT TGT GAT GT GGT AC 
CTCAGCCTTCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTTA 
C AGAC AAG ACTT TC AC TT C ACACT TC GAG AGC AT T C AAACT GCT CTC ATC AAAATC C AT T T C TGGTC AT T C 
* > TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
s**1 TGGT GGGGAT AT GAGGTT CT TAC ATT T T T C TT AT T AGGCC AAGAGGCT GAAAAGGAAG AC AAAATGTTGGC 

Q ATTGTCCTTAGAGGATGAACACCTTCTTTATGGTGACATAATCCGACAAGATTTTTTAGACACATATAATA 
lO ACCTGACCTTGAAAACCATTATGGCATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATG 
AAGAC AGACACT GAT GTT TT C AT C AAT ACT GGCAATTT AGTG AAGT ATCT TTT AAACCT AAACC ACTC AG A 
, '*\ GAAGTTTTTCACAGGTTATCCTCTAATTGATAATTATTCCTATAGAGGATTTTACCAAAAAACCCATATTT 
%l CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 

TTGGTGCC AAGG AT CT AT GAAAT GAT GGGT CACGT AAAAC CC AT C AAGTT TG AAGATGT T TAT GT C GGGAT 

5 

P* g CTGT TT GAAT T TAT TAAAAGT GAAC AT T CAT AT TCC AGAAGACAC AAATCTTT T CT T TCT AT AT AG AATCC 

; n AT T T GGAT GTCT GT C AAC TG AG AC GT GT GATT GC AGCCCAT GGCT TTT CT T CC AAGGAGATC AT C ACT T T T 

htfi 

f'i TGGC AGGTCATGCT AAGGAAC ACC AC AT GC CATT AT TAA CTT C AC ATT CT AC AAAAAGCC T AGAAGGACAG 

■S9'S 

Qj GAT ACC TTGT GGAAAGTGTT AAAT AAAGTAGGT ACT GTGGAAAAT TCATGGGGAGG TCAGTGT GCT GGCT T 
CJ AC AC TGAACT GAAAC T CATG AAAAACCC AG ACTGGAGACTGGAGGGTT AC ACT T GTGATT T AT T AGTC AGG 

Ml CCCT TC AAAG AT GAT ATGTGGAGGAAT T AAAT AT AAAG GAAT TGGAG G TT T TT GCT AAAG AAAT T AAT AGG 

ACCAAAC AAT TT GGAC AT GT C ATT CT GT AG ACT AGAAT TTCT T AAAAG GGT GT T AC TGAGTT AT AAGCTC A 
CTAGGCTGTAAAAACAAAACAATGTAGAGTTTTATTTATTGAACAATGTAGTCACTTGAAGGTTTTGTGTA 
TATCTTATGTGGATTACCAATTTAAAAATATATGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATA 
CTGAACAAAATTTTACCTGTTTTTGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATT 
ATTATTTAAAATTACTTCAACTTTGTGTTTTTAAATGTTTTGACGATTTCAATACAAGATAAAAAGGATAG 
T G AATC ATTC TT T AC ATGC AAACATT T TCC AGTT AC TT AACT GAT C AG TTT AT TAT TGAT AC ATC ACT C C A 
TTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAATATTTTACTGTGGT 
AAT ATAGAG AAGAAT T AAAGC AAGAAAATC TGAAAA 
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FIGURE 36 



MASALWTVLPSRMSLRSLKWSLLLLSLLSFFVM^ 

TLREHSNCSHQNP FLVI LVTSHPSDVKARQAIRVTWGEKKSWWGYEVLT FFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KP IKFEDVYVGI CLNLLKVNIH I PEDTNL FFLYRIHLDVCQLRRVI AAHGFS SKE 1 1 T FWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



C GC TCGGGC AC C AGC C GCGGC AAGGATGGAGC T GGGT TGC TGGACGCAGTT GGGGC TCACT TT TC T TC AGC T C CT TC TC ATC 
TCGTCCTT GCC AAGAGAGT AC AC AG TC AT TAAT GAAG C C T G CC C TGGAGCAGAGT GGAATAT CAT G T G T CGGGAGT GC TG T G 
AATATGATCAGATTGAGTGCGTCTGCCCCGGAAAGAGGGAAGTCGTGGGTTATACCATCCCTTGCTGCAGGAATGAGGAGAA 
TGAGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACTGCAAGAGCTGCCGAAATGGCTCATGGGGGGGT 
ACCTTGGATGACTTCTATGTGAAGGGGTTCTACTGTGCAGAGTGCCGAGCAGGCTGGTACGGAGGAGACTGCATGCGATGTG 
GCCAGGT TC TGC GAGC C CCAAAGGGTC AGAT T T TGT T GGAAAGC T ATCCCC TAAAT GC TC ACTGTGAATGGAC CAT T CATGC 
TAAACCTGGGT T TGT CATC CAAC T AAGAT TTGTCAT GT TGAGTC TGGAGTT T GACT ACATGTGCC AGTATGAC TATGTTGAG 
GT TC GTGATGGAGACAACC GC GATGGCC AGAT CAT CAAGC GTGTCTGTGGCAACGAGCGGC CAGC TC C TATCCAGAGCAT AG 
GATCC TC AC TC C ACGT CCT CT TCC ACTCCGAT GGC T C CAAGAAT T TTGACGGT T TC CATGC CAT T TAT GAGGAGATC AC AGC 
ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
fth TATAC TGGGCAGCGC TGTGAAAATC TCC T TGAAGAAAGAAAC TGC TC AGAC C C TGG G GGCC C AGTCAATGGGTAC CAGAAAA 

Q TAACAGGGGGCCCTGGGC T TATCAACGGACGC C AT GC TAAAAT TGGC ACCGT GGTGT CT TTCT T TTGTAACAACTCCTATGT 

Q TCTTAGTGGCAATGAGAAAAGAAC T TGCC AGC AGAAT GGAGAG TGGTC AGGGAAACAGCC C ATC TGC ATAAAAGC CT GCC GA 

r|| GAAC C AAAGAT T TCAGACC TGGTGAGAAGGAGAGT T C TT C CGATGCAGGTTC AGTCAAGGGAGAC AC CATTAC AC CAGC TAT 

h| ACTCAGCGGCCTTCAGCAAGCAGAAACTGCAGAGTGCCCCTACCAAGAAGCCAGCCCTTCCCTTTGGAGATCTGCCCATGGG 
\i ATACC AACATC T GCAT ACCC AGC TC CAG TATGAGTGCAT C TCACCCT TC TACCGCC GCC TGGGC AGC AGCAGGAGGACATGT 

= . j C T GAGGAC TGGGAAGTGGAGTGGGCGGGCACCATCCT GC ATCCC TATC TGCGGGAAAAT TGAGAACATCAC T GCT CCAAAGA 

?n CCC AAGGGT TGCGCT GGCCGT GGCAGGCAGC CATC TACAGGAGGACC AGCGGGGTGCATGACGGC AGCC TAC ACAAGGGAGC 

GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 

a ... 

GTCACCAT GATCAAGAC AGCAGAC C TGAAAGTTGT TT TGGGGAAATTC T ACC GGGATGATGACC GGGATGAGAAGAC CATC C 
AGAGCCTACAGAT TTC TGC TATCAT T CTGCATCCC AACTATGACCCC ATCC T GCTT GATGC TGACATCGCC AT CC TGAAGC T 

is; 

'%! CCTAGACAAGGCCCGTATCAGCACCCGAGTCCAGCCCATCTGCCTCGCTGCCAGTCGGGATCTCAGCACTTCCTTCCAGGAG 
'"Z*t TCCCACAT CACT GTGGC TGGC TGGAATGTCC TGGCAGAC GTGAGGAGCCCT GGCTT CAAGAACGACAC ACT GCGC TC TGGGG 

TGGTC AGT GT GGTGGAC TCGC TGC T GTGTGAGGAGC AGC ATGAGGACC ATGGC ATC C CAG TGAGTGTCACTGATAAC AT GT T 

r! 

CTGTGCCAGCTGGGAACCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCCGGGA 
C GAGC AT C T C C T GAGC CACGC TGGCATC TGATGGGAC TGGTCAGC TGGAGC TATGATAAAACATGCAGCCAC AGGCTCT C CA 
CTGCC TTC ACCAAGGTGCTGCC T T T TAAAGAC TGGAT TGAAAGAAAT ATGAAATGAACCAT GC TCATGCAC TC CTTGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCC AGGGC TT C TGAC TTC AGGGACAAAAC T C AGT GAAGGGTGAGT AGAC C T CCAT TGC TGGTAGGC TGATGCC GC GTCCA 
C TAC TAGGACAGCCAAT TGGAAGAT GCCAGGGC TTGC AAGAAGTAAGTT TC T T CAAAGAAGACC ATAT ACAAAAC CT C TCC A 
CTCCACT GACCTGGTGGTC T T C C C CAAC T TTCAGT TATAC GAATGCCATCAGC TTGACCAGGGAAGATC TGGGC T TC ATGAG 
GCCCC TT T T GAGGCTC TCAAGT T C TAGAGAGC TGCC T GT GGGACAGCCC AGGGCAGCAGAGCTGGGATGTGGT GCATGC C T T 
TGTGT ACAT GGCCAC AGTAC AGTC TGGTCCT T T TC C T TC C CCATC TC TTGTAC ACAT T T TAATAAAATAAGGGTTGGCT T C T 
GAAC TACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE ft8 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
I PCCRNEENE CDS CL I HPGC.T I FENCKS CRNGS WGGTLDD FYVKGFYCAE CRAGW YGGDCMRCGQ 
VLRA.PKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRDGDNRDGQII 
KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSYKC 
ACLAGYTGQRCENLLEERNCSDPGGPWGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQP IC IKACREPKI SDLVRRRVLPMQVQSRET PLHQLYSAAF.SKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALWERTVWAAHCVTDLGKVTMIKTADL 
KWLGKFYRDDDRDEKT I QS LQ I SAI I LHPNYDP I LLDAD I AI LKLLDKARI S TRVQP I CLAASR 
DLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSGWSWDSLLCEEQHEDHGIPVSVTDNMFCA 
SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature, 
amino acids 260-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAATCTTCTTACGGGCCCGTGATTTATTAACGTGGCTTAATC 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTT GGGCCCTT GTAGCT GACAGAAGGTGGCCAGGGAGAATGCAGCACACT GCTCGGAGAATGAAGGCGCTTCTGTTGC 
TGGTCTT G CCT T GGCT CAGT CCT GCT AACT ACATT GACAAT GT GGGCAAC CT GCAC TT CCT GTAT T CAGAACT CT GTA 
. AAGGT GCCTCCCACTAC GGCCT GACCAAAGATAGGAAGAGGCGCT C AC AAGAT GGCT GTC CAGAC GG CT GT GCGAGC C 
T CACAGCCAC G GCTC CCT CCCCAGAG GTTT CTGCAGCT GCCAC CAT C T CCTTAAT GACAGAC GAGCCT GGCCTAGACA 
ACC CT GCC TAC GT GT CCT CGGCAGAG GACGGGCAGCCAGCAAT CAG CCCAGT GGACT CTGGC CGGAGCAACC GAACTA 
GGG CACG GCCC TT T GAGAGAT C CACTAT TAGAAGCAGATCATT TAAAAAAAT AAAT CGAGCTT TGAGT GTT CT T CGAA 
GGACAAAGAGCGGGAGT GCAGT T GC CAACCATGCCGACCAG GGCAGGGAAAATTCT GAAAAC ACCACT GCC C CT GAAG 
TCTTTCCAAGGTTGTACCACCTGATTCCAGATGGTGAAATTACCAGCATCAAGATCAATCGAGTAGATCCCAGTGAAA 
GCCT CT CTATTAG GCT G GT GG GAGGTAG C GAAACCC CACT GGT CCATATC AT TAT CCAACAC ATTT AT C GT GAT GGGG 
TGAT C G CCAGAGACGGC CGGCTACT GCCAGGAGACAT CATT CTAAAGGTCAAC GGGAT GGACATCAGCAATGT CC CT C 
ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 
GCAGGAAC AAT G GACAGGCC CC GGAT GCCTACAGACCCC GAGATGACAGCTTT CAT GT GAT T CTCAACAAAAGTAGC C 
CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGAT GAGCCT GGGGTTTTCATCTTCAATGTGCTGGATGGCG 
GTGTGGCATATCGACATGGTCAGCTTGAGGAGAATGACCGTGTGTTAGCCATCAATGGACATGATCTTCGATATGGCA 
GCCCAGAAAGT GC GG CT CAT CT GATT CAGG CCAGT GAAAGACGT GTT CAC CT C GT CGT GT CCCGC CAGGTT C GGCAG C 
GGAGCCCT GACAT CTT TCAGGAAGCC GGCT GGAACAGCAAT GGCAGCT GGT C C CCAGG GC CAGGGGAGAGGAGCAACA 
CTCCCAAGCCCCTCCATCCTACAATTACTTGTCATGAGAAGGTGGTAAATATCC7WWVGACCCCGGTGAATCTCTCG 
GCATGACCGTCGCAGGGGGAGCATCACATAGAGAATGGGATTTGCCTATCTATGTCATCAGTGTTGAGCCCGGAGGAG 
TCATAAGCAGAGATGGAAGAATAAAAACAGGTGACATTTTGTTGAATGTGGATGGGGTCGAACTGACAGAGGTCAGCC 
GGAGT GAGGCAGTGGCAT TAT T GAAAAGAACAT CAT CCT CGAT AGTACT CAAAGCT TT GGAAGT CAAAGAGT AT GAGC 
CCCAGGAAGACTGCAGCAGCCCAGCAGCCCTGGACTCCAACCACAACATGGCCCCACCCAGTGACTGGTCCCCATCCT 
GGGTCATGTGGCTGGAATTACCACGGTGCTTGTATAACTGTAAAGATATTGTATTACGAAGAAACACAGCTGGAAGTC 
\ W T GGGCTTCT GCATT GTAGGAGGT TAT GAAGAATACAAT GGAAACAAAC CT T TTT T CAT CAAAT C CATT GTT GAAGGAA 

CAC CAGCATACAAT GATGGAAGAAT TAGATGTGGT GAT AT TCTTCTTGCTGT CAAT GGT AGAAGTACAT CAGGAATGA 
T ACAT GCTT GCTT GGCAAGAC TGCT GAAAGAACTTAAAGGAAGAATT ACT CTAACT AT T GT TT CT T GGCCT GG CACTT 
TTT T ATAGA AT CAAT GAT GG GT CAGAG GAAAACAGAAAAAT C ACAAATAGGCTAAGAAGTT GAAACAC TAT ATT TAT C 
TTGTCAGTTTTTATATTTAAAGAAAGAATACATTGTAAAAATGTCAGGAAAAGTATGATCATCTAATGAAAGCCAGTT 
ACAC CT CAGAAAAT AT GAT T CC AAAAAAATTAAAACT ACT AGTTTT TTT T CAGT GT GGAGGATT T CTCATT ACT CT AC 
AACAT T GTTT AT AT TT TTT CTAT T CAATAAAAAGC C CTAAAACAACT AAAAT GATT GATT T GTATACCCCACT GAATT 
CAAGCT GATT T AAATTTAAAATT T GGTATAT GCT GAAGT CT GC CAAGG GTACATTATGGC CATT TT TAATT TAC AGCT 
AAAATATTT T TTAAAAT GCAT TG CT GAGAAACGTT GCT TT CAT CAAACAAGAATAAATAT T TTT CAGAAGTTAAA 



Li 
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FIGURE AO 



MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVS AAAT I S LMTDE PGLDNPAYVS S AEDGQPAI S PVDS GRSNRTRARP FERS T IRSRS FKK I NR 
ALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGS E TPLVH 1 1 1 QH I YRDGVI ARDGRLLPGD 1 1 LKVNGMD I SNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGVFIFNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
. t IKTGDILLOTDGVELTEVSRSEAVALLKRTSSSIVLK^ 

j*j SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
r| RIRCGDI LLAVNGRS TSGMIHACLARLLKELKGRI TLT I VSWPGT FL 

it 

Important features: 
i .] Signal peptide: 

?n amino acids 1-15 

H4 N-glycosylation sites. 

m amino acids 108-112, 157-161, 289-293, 384-388 

m 

f-\ Tyrosine kinase phosphorylation sites. 

f|| amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 



ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGAAGCTTT 
CTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAACATGGGCTTCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGAC CCTCAT TTTGGGGAAGGGAAAAACT C TGACTAATGAAGCAT CCACGAAGAAGGT AGAACTT 
GACAACTGTCCTTCTGTGTCTCCTTACCTCAGAGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCAC 
T TT G GAAGAGGT ACAGGCAGAAAATCC CAAAGT GTC CAGAGGCCGGTAT CGCCCTCAGGAATGTAAAG 
CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
M CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
Wf TAAAAAGTTTAATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGG 
*p ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 

CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 
%\ . GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 

yj GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
Q CCTGAAGTGGGTAAATATACAATGGTCTTCCAGACTAGAGACAAAGGCAATGAGGTGAACGCAGAACG 
15 GATGAAGCTCTTACACCAAGTGTCACGAGTCTGGAGAACAGATGGGTTGAGTAGTTGTTCTTATAAAT 

TAGTATC T GTGGAACACAAT CCT T T ATATATCAAC ATCACAGTGGAT TT CTGGTTTGGT GCATGACCC 
TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
f| ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
r! TAGCAGAGCTCCTGGT GAT GTAGAGTATAAAACAGTT GTAACAAGACAGCTTT CT TAGT CATTTT GAT 

f|J CATGAGGGTTAAATAT TGT AATAT GGAT ACTT GAAGGACT TTATATAAAAGGATGACTCAAAGGATAA 

AATGAACGCTAT T TGAGGACT CT GGTT GAAGGAGATT TAT TTAAATTTGAAGTAATATAT TAT GGGAT 
AAAAGGCCACAGGAAATAAGACTGCTGAATGT C TGAGAGAACCAGAGTT GTTCTCGTCCAAGGTAGAA 
AGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTGAAGTGGTGGTGTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGTGAACTTGGGAATGAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
T TT TAAAGAGT T TTTGTAAAATGATTT TGTACAAGTAGGATAT GAATTAGCAGTT TACAAGTTTACAT 
ATTAACTAATAATAAATATGTCTATCAAATACCTCTGTAGTAAAATGTGAAAAAGCAAAA 
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FIGURE 42 

MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEFMANFHKTLILGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKL I FKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAI LVPH 
RNREKHLMYLLEHLHP FLQRQQLDYG I YVI HQAE GKKFNRAKLLNVG YLE ALKEENWDC FI FHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINITVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE A3 



GCT CAAGACCCAGCAG T GGGACAGCCAGACAGACGGCACGATGGCAC T GAGC TCCCAGATC TGGG 
CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 
ACGGGACAAC T T GCAGAGCT GCAACCCCAGGACAGAGCT G GAGC CAGGGCCAGC TGGATGCCCAT 
GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 
ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 
TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 
T CCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE Ad 

MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
C I FCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

H amino acids 58-59 

a 

N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 



□ 



ill 
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FIGURE 4F» 

GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCTTCAACACAACCCCTCTTGTCACCATACAGCCAGAAGGGGGCACTATCATAGTGACCCAAA 
ATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAGCTCAGCAAACTGAAG 
AAGAAT GACT CAGGGATC TAC T AT GT GGGGAT ATAC AGCT CAT CAC T C CAGCAGCCCT CCAC CCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
AT AAGAAT GGCACCTGTGTGACCAATGT GACATGC TGCATGGAACATGGGGAAGAGGATGTGAT T 
M TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
41 C TGGAGAT GGGGAGAAAGT GATAT GACCT TCAT C T GC GT T GCCAGGAACCC TGT CAGCAGAAAC T 

: zl TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
7a GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
%l GTT TCTGAAGAGAGAGAGACAAGAAGAGTACAT TGAAGAGAAGAAGAGAGTGGACAT TTGTCGGG 

III AAAC T CC TAACATAT GCCCCCAT TC TGGAGAGAACACAGAGT ACGACACAATCCC TCAC AC T AAT 

m 

w AGAACAATCC TAAAGGAAGAT CCAGCAAAT ACGG T T T AC TCCAC TGT GGAAAT ACCGAAAAAGAT ■ 

J,. GGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 

7h TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 

\'i 

S 
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FIGURE 46 



MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VARNPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIPKKMENPHSLLTMPDT 
PRLFAYENVI 



m 



a 



Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 



GGCTCGAGCGTTTCTGAGCCAGGGGTGACC ATGA CCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGGCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAGAAAAAGAGCGTG 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGT AAT G C CAAT T G T GAAT T T T CAT T GAAAAAC AT C AG T GACAT T CAT CC AGAAT C C T T C AA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAA^CCCACCAGTAACG 
ACACCATGGCGAGTGGCTGGAGAGCATCTAGTTTCCACTTCGATTCTGAAGAAAACAAACATAGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
T G TAGT T T AATGGGAATAAAATGTAAGTAT CAG TAGT T TGAAAAAAAAAAA 
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FIGURE 48 

MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
T TMSLTARKRACCNNRTGMFLS S FFSVI TVIGAL YCML I S I QALLKGPLMCNS PSNSNANCE FSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE aq 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAACC TGCAGAGTCACACAG T TGC T GACCAATTGAGCTGT GAGCC TGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATTTGCCAATAAAGACGATGCCTTCTACTATGACTGGAAAAACCTGCAGCTGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
M TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 

TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 

it's 

ill 

ill 
a 



m 

■ u 
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FIGURE so 

MERVTLALLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCKYKS 
SQKQHS PVPEKAI PLI T PGSATTC 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 36-59 

N-myristoylation sites. 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 , 

amino acids 54-67 
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FIGURE si 



G TGGACT C T GAGAAGCCCAGGCAGT T GAGGACAGGAGAGAGAAGGC T GCAGACCCAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCCT GGTGGGAGGAAGACAC TC T GGAGAGAGAGGGGGC TGGGCAGAGATGAAGT T CCAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
AGCAGAT GCT T T GGGCAACAGGGT CGGGGAAGCAGCCCAT GCT C TGGGAAACAC TGGGCACGAGA 
^ TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 
% GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
2fi CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
y| GAAACT CAGCAGGCAGC T T TGGAAT GAAT C CT CAGGGAGC TC CC TGGGG T C AAGGAGGCAAT GGA 

"%) GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 
AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 

ft 

AC T C T G G G G GAG G C AG C G G C T C AC AG T C G G G C AGC AG T G G C AG T G GC AG C AAT G G T G AC AAC AAC 
ffi AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
jjf TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 
C! CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
y * CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 

0 

«H C T T C AGAG G AC AGG GAG T T T C CAG CAACAT GAGGGAAAT AAG C AAAGAG G GCAAT CGCCTCCTTG 

i II 

GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 

GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 

GAATTTTAAATCCAAGCTGGGTTTCATCAACTGGGATGCCATAAACAAGGACCAGAGAAGCTCTC 

GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 

AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA?\AAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 52 

MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHEIGRQAEDVIRHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETS'PGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 



Signal peptide: 

amino acids 1-21 

N-glycosylation site. 



'H amino acids 2 65-269 



Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 
Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 



N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67/ 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 53 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
GAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACATCATGCTTGACAAGTGGCAGCACCTGG 
CCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAGCACATCAGCCTCATGACCTTGGACAGTCTA 
CAGAAAT GCATC TTCAGCT TTGACAGCCAT TGTCAGGAGAGGCCCAGTGAATATATTGCCACCAT 
C T TGGAGCTCAGTGCCCTT GTAGAGAAAAGAAGCCAGCAT AT CCTCCAGCACATGGAC T T T CT GT 
ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
AGCCAAGT CCAAGAC T T T GGAT T TCATT GAT GTGC T T CT GC TGAGCAAGGAT GAAGAT GGGAAGG 
CATTGTCAGATGAGGATATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACG 
GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
ACAGGAGGTGCAAGAGCT TCTGAAGGACCGCGATCC TAAAGAGATTGAAT GGGACGACCTGGCCC 
AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE zj. 

MSLLSLPWLGLRPVAMSPWLLLLLWGSWLLARILAWTYAFYNNCRRLQCFPQPPKRNWFWGHLG 
LITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLFIRFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 
LAF I PFS AGPRNC I GQAFAMAEMKWLALMLLHFRFLPDHTE PRRKLEL IMRAEGGLWLRVE PLN 
VGLQ 

Important features: 
Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 461-471 

N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE 

ATCGCATCAATTGGGAGTACCATCTTCCTCATGGGACCAGTGAAACAGCTGAAGCGAATGTTTGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAA TTCATGGCCAGTTTTATGAAGCTTTGGAAGGCACTATGGACAGAAGCTGGTGGACAGTTTT 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GTGATTCGAACATTTGAGGGTTACTTTTGGAAGCAACAATACATTCTCGAACCTGAATGTCAGTA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
T CT GGAT GTT GTCCCACT GAAT TCCCAT GAAT ACAAACCT AT T CAGCAACAGCAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAA&AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 56 



MGPWQLKRKFEPTRLIATIMVLLCFALTLCSAFWWHNKGLALIFCILQSLALTWYSLSFIPFAR 
DAVKKCFAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen -binding domain protein. 

amino acids 30-72 
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FIGURE 57 



CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACATGCTCTGTCTGTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
ATTTTCAAGCTCAGTGTCTTCATCCCCTCCCAGGAATTCTCCACCTACCGCCAGTGGAAGCAGAAAATTGTACAAGCT 
GGAGATAAGGACCTTGATGGGCAGCTAGACTTTGAAGAATTTGTCCATTATCTCCAAGATCATGAGAAGAAGCTGAGG 
CTGGTGTTTAAGATTTTGGACAAAAAGAATGATGGACGCATTGACGCGCAGGAGATCATGCAGTCCCTGCGGGACTTG 
G GAGT CAAGATATCTGAACAG CAG GCAGAAAAAATT CT CAAGAGCAT GGAT AAAAACGGCAC GAT GAC CAT C GACT G G 
AAC GAGT GGAGAGACTAC CACC T C CT C CACC C CGTG GAAAACAT CC CCGAGAT CAT C CT CT ACT GGAAGCATT C CAC G 
AT CT TT GAT GT GGGTGAGAAT CTAACGGT CC C GGAT GAGTT CACAGT GGAGGAGAGG CAGACGG GGAT GTGGT GGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGTC CAT GC C T C CC GCAGCAACAAC AT GGG CATC GT T GGT GGCTT C ACT CAGAT GAT T C GAGAAGGAGGGGC CAGG 
T CACTCT GGCG GG GCAAT G GCAT CAAC GT C CT CAAAAT T GC C CC CGAAT CAGC CAT CAAAT T CATGGCCTAT GAGCAG 
ATCAAGCGCCTTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCAGGGGCC 
AT C GCCCAGAGCAGCAT CT ACC CAAT GGAGGT C CTGAAGAC C C GGAT GGC GCTG CGGAAGACAGGCCAGTACT CAGGA 
ATGCTGGACTGCGCCAGGAGGATCCTGGCCAGAGAGGGGGTGGCCGCCTTCTACAAAGGCTATGTCCCCAACATGCTG 
j , GGCAT CAT CC CCTAT G CCGGCAT C GAC CTTGC AGT CTACGAGACGCT CAAGAAT GCCT GGCT GCAGC ACTAT GCAGT G 

? f l AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
Q CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
a?* TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
M GCT GTGAG CAT CAGCTAC GTGGT CTAC GAGAACCTGAAGAT CAC CCT GGGC GT GCAGT C GC GGTGAC GGGG GGAGGGC 

yl CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
j s \ AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
^ GTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGG 
%l C TC C GGGCT C ACAT GT GTAAG GACAGGACAT T T T CT GCAGT GCCTGCCAATAGT GAGCT T GGAGCCT GGAGG C C GGCT 

| , | TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 

CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
Q ATAATCCAT GAT GAAAGG T GAG GT CAC GTGGC CT CC CAGGC CT GAC TT C C CAAC CTACAGCAT T GACG C CAACTT GGC 

1 TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 

CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
O GGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTGTCCCCACTGTGGCATGAGGGCAGTGGAGCA 
fl'l CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
7.! ACTGTTGGGAAAAGGGTTTTGTCCAGAAGGACAAGCCGGACAAATGAGCGACTTCTGTGCTTCCAGAGGAAGACGAGG 
Q GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
{ -*^ GGGACCAGCCCCACATTCCACTTGTGTCACTGCTTGGAACCTATTTATTTTGTATTTATTTGAACAGAGTTATGTCCT 
'til AACTATTTTTATAGATTTGTTTAATTAATAGCTTGTCATTTTCAAGTTCATTTTTTATTCATATTTATGTTCATGGTT 
Q' GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
pij GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
f ^* GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 

AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
T GAG GAAAT C T TCAAT AGGAT GCAAAGAT CAAT GCAAAAAT T GT TATATAT GAACATATAACT GGAGT C GT CAAAAAG 
CAAATTAAGAAAGAATTGGAC GT TAGAAGT T GT CATTTAAAGCAG CCTT CTAAT AAAGTT GTT T C AAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 58 



MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEE FVHY LQDHEKKLRLVFKI LDKKNDGRI DAQE IMQSLRDLGVKI SEQQAEKI LKSMDKNG 
TMT I DWNEWRDYHLLHPVENI PE I ILYWKHST I FDVGENLTVPDEFTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVmQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARR 
II^EGVAAFYKGWPmLGIIPYAGIDIAVYETLKNAWLQHYAWSADPGVFVLLACGTMSSTC 
GQLASYPLALVRTRMQAQAS IEGAPEVTMSSLFKHI LRTEGAFGLYRGLAPNFMKVI PAVS I SYV 
VYENLKITLGVQSR 

Important features: 



!*f Signal peptide: 



amino acids 1-16 

Putative transmembrane domains : 

amino acids 284-304, 339-360, 376-394 



Mitochondrial energy transfer proteins signature. 

^ amino acids 206-215, 300-309 



N-glycosylation sites. 

amino acids 129-133, 169-173 



Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE SQ 

GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACAT T GGGGAGGAT GGAAT CC TGAGC T GC AC T T T T GAACC T GAC ATC AAAC T T T C T GAT AT 
CGTGATACAAT GGCT GAAGGAAGG T G T T T T AGGC T TGGTCCAT GAGT T CAAAGAAGGCAAAGATG 
AGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTTGCTGATCAAGTGATAGTT 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
|, f e. CATCACT T CTAAAGGCAAGGGGAATGCTAACCT T GAGT AT AAAAC T GGAGCC T T CAGCATGCCGG 

C! AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
Q CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
f[\ CAGC T T T GAGCTGAACTC T GAGAATGTGACCATGAAGGT T GTGTCTGTGCTC TACAAT GT TACGA 

rj T C AAC AAC AC AT AC T C C T G TAT G AT T G AAAAT G ACAT T G C CAAAGCAAC AG GGG AT AT C AAAG T G 

! r| ACAGAATC GGAGATCAAAAGGCGGAGTCAC C TACAGC T GC TAAAC TCAAAGGCT TC TCT GT GT GT 

Q CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 

15 GTGCCT TGG CCACAAAAAAGCATGCAAAGT CAT T GT TACAACAGGGAT C T ACAGAACT AT T T CAC 

r! 

CACCAGATATGACCTAGT TT TATAT TTCTGGGAGGAAATGAAT TCATATCTAGAAGTCTGGAGTG 

yi 

m AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAAT 

W 

f|| C TAT C T T CAAAGACATAT TAGAAGT T GGGAAAATAAT T CAT GT GAAC TAGACAAGT GT GT TAAGA 

Q GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
1 CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 

TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
T C T T C C C AAC T G ACAAAT GC C AAAG T T GAG AAAAAT GAT C AT AAT T T TAG CAT AAAC AG AGC AG T 
CGGGGACACCGAT TT TATAAATAAACT GAGCACC T TC T T TT TAAACAAAAAAAAASAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 

MASLGQILFWS I ISI I IILAGAIALI IGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
D I VI QWLKEGVLGLVHE FKEGKDEL S EQDEMFRGRTAVFADQVI VGNASLRLKNVQLTDAGT YKC 
YIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTWWASQVDQGANFSEVS 
NTSFELNSENVTMKWSVLYNVTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 
CVS SFFAI SWALLPLS PYLMLK 

Important features: 
M Signal peptide: 

•M amino acids 1-28 

/V| Transmembrane domain: 

%! . amino acids 258-281 

k *« N-glycosylation sites. 

% % amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 

AH 220-224 



Ul N-myristoylation sites. 

?*% 

— amino acids 52-58, 126-132, 188-194 

fll 
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FIGURE 61 

T GACGTCAGAATCACCATGGC CAGC TATCCT TACCGGCAGGGC TGCC CAGGAGC TGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 
AT TGGT CT T CAT TCAATGAT GAGAC C TGCCTCAT GAT GAT AAACATG T TT GACAAGAC CAAGTCA 
GGC C GCAT CGAT G T C T ACGGC T T C T C AGCCC T GT GGAAAT T CAT C CAGCAGT GGAAGAACC T CT T 
CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 
CCATGACAGCTTCTCGGATGCTATGACCCAACCATCTGTGGAGAGTGGAGTGCACCAGGGACCTT 
TCCTGGCTTCT T AGAG T GAGAGAAG T AT GT GGAC AT CT CT TCTTTTCCTGTCCCTC TAGAAGAAC 
AT TC TC CC T T GC T TGATGCAACACT G T T CCAAAAGAGGGT GGAGAGTC C T GCAT CATAGCCACCA 
AATAGTGAGGACCGGGGCT GAGGCCACACAGATAGGGGCC T GAT GGAGGAGAGGATAGAAGT TGA 
ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCATGAGAGAGATCTGTCTATGGGACCAGTGGCTTGGATTCTGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
AT T TGGGGC CAAAAGTC CAGTGAAAT T GT AAGC T T CAAT AAAAGGAT GAAAC TC T GA 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
S GY I SMKE LKQALVNCNW S S FNDS T CLMM I NMFDKTKS GRI DV YGFS ALWKFI QQWKNL FQQYDR 
DRSGS ISYTE LQQALS QMG YNL S PQFTQLLVSRYCPRSANPAMQLDRFI QVCTQLQVLTEAFREK 
DTAVQGNIRLS FEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids- 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 



APP_ID=10063730 



Page 216 



FIGURE 63 



CAGGATGCAGGGCCGGGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCGTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 
CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 
GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 
TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
TGCTCATCTCCAATAAAATAAAAGCACTTATGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 6a 



MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSW 

LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

N-myri s toyl a ti on si tes . 

amino, acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 6s 



AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACT ATGG GGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 



MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARSQHHWCNT 

Important features: 
Signal peptide: 
amino acids 1-19 

r ? ~ N-glycosylation site. 

r\ 

*H amino acids 37-41 

v ' | 

| £ | N-myristoylation sites . 

%l amino acids 15-21, 19-25, 60-66 

0 



:.ru 
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FIGURE 67 

ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGGGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATC TAGG TCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
AGAC TCTGCCCCT G AGG T C AAGAG AGG AT G G G G C TAT T C AC T T T TAT AT AT T T AT AT AAAAT TAG 
TAGTGAGATGTAAAAAAAAAAAAA&AAAA 
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FIGURE 68 

MANPGLGLLLALGLPFLLARWGRAWGQIQTTSMTENSTVLPSSTSSSSDGNLRPEAITAIIWFS 
LLAALLLAVGLALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 



Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 8 6-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-myristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6q 

GCCAGGAATAACTAGAGAGGAACAATGGGGTTATTCAGAGGTTTTGTTTTCCTCTTAGTTCTGTGCCTGCTGCACCAG 
TCAAATACTTCCTTCATTAAGCTGAATAATAATGGCTTTGAAGATATTGTCATTGTTATAGATCCTAGTGTGCCAGAA 
GAT GAAAAAATAATT GAACAAAT AGAGGATAT G GT GACT ACAGCT TCTAC GTACC TGTT T GAAGCCACAGAAAA&AGA 
TTTT TTT T CAAAAAT GT AT CTAT ATT AAT T CCT GAGAAT T GGAAGGAAAAT CCT CAGTACAAAAGGCCAAAACAT GAA 
AAC CATAAACAT GCT GAT GTTATAGTT GCAC CAC CTACACTC C CAGGTAGAGAT GAAC CATACAC CAAG CAGT T CAC A 
GAATGTGGAGAGAAAGGCGAATACATTCACTTCACCCCTGACCTTCTACTTGGAAAAAAACAAAATGAATATGGACCA 
C C AG GC AAACT GT T T G T C CAT G AGT G G GCT CAC CTCCGGTGGG G AGT GT TT GAT GAGT AC AAT GAAG AT C AG C C T T T C 
TAC C GT GCTAAGT CAAAAAAAAT C GAAGC AACAAGGT GTT CCGCAGGTAT CT CT G GTAGAAATAGAGT T T ATAAGTGT 
C AAGGAGGCAGCT GT CTT AGT AGAGCAT GCAGAAT T GATT CT ACAACAAAACT GTAT G GAAAAGATT GTCAATT CTTT 
CCTGATAAAGTACAAACAGAAAAA.GCATCCATAAT GTTTATGCAAAGTATTGATTCTGTT GTT GAATTTT GTAACGAA 
AAAAC C CAT AAT CAAGAAGCT CCAAGC CTACAAAAC ATAAAGT GCAATTTTAGAAGT ACAT GGGAGGT GATTAGCAAT 
T CT GAGGAT TT T AAAAACACCAT ACCC AT GGT GACAC CACCT CCT C CACCT GT CTT CT CAT T GCTGAAGAT CAGTCAA 
AGAATTGT GTGCTTAGTT CTT GATAAGT CT G GAAGCAT GGGGGGT AAGGAC CGCCT AAAT CGAAT GAATCAAGCAGCA 
AAACAT T T CCT GCTGCAGACT GTTGAAAAT GGAT CCT GGGT GGGGAT GGTT CACTT T GATAGTACT G C CACT ATT GT A 
AAT AAGCT AAT C CAAATAAAAAGCAGT GAT GAAAGAAAC AC ACT CAT GGCAG GATT AC CT ACAT AT C C TCT GG GAGGA 
ACTTCCAT CT GCT CT GGAAT T AAAT AT GCAT TT CAGGT GAT T G GAGAGCT ACATT CCCAACT C GAT GGAT CC GAAGT A 
CT GCT GCT GACT GAT GGGGAGGATAACACTGCAAGTT CTTGTATT GAT GAAGTGAAACAAAGTGGGGCCATTGTTCAT 
TT TATT G CTTT GGGAAGAGCTGCT GAT GAAGC AGT AAT AGAGATGAGCAAGATAACAGGAGGAAGT CATTTT T AT GTT 
T CAGATGAAGCTCAGAACAATGGCCT CATTGATGCTTTTGGGGCT CTTACATCAGGAAATACTGATCTCTCCCAGAAG 
T C C CTT CAGCT C GAAAGT AAGGGAT TAACACT GAATAGTAAT GCCT GGAT GAACGACACT GT CATAATTGAT AGTACA 
GT GGGAAAGGAC ACGT T CTTT CTCAT CACAT GGAACAGT CT GC CT CCCAGTAT TT C TCT CT GGGAT C C CAGT GGAACA 
AT AAT GGAAAATTT CACAGT GGAT GCAACTT CCAAAAT GG C CTAT CT CAGTAT TC CAG GAACT GCAAAGGT G GGCACT 
T GGGCAT ACAAT C TT CAAGC CAAA.GCGAAC CCAGAAACAT T AACTATTAC AGTAACT T CT CGAGCAGCAAATT CT TCT 
GT G CCT CGAAT CACAGT GAAT GCT AAAATGAAT AAGGAC GTAAACAGT TT C CCCAGCC CAATGATT GT TTAC GCAGAA 
ATT CTACAAGGAT AT GTACCT GT T CT TGGAGCCAAT GT GACT GCT TTCAT T GAAT CAC AGAAT GGACATACAGAAGTT 
TT GGAAC T TTT GGATAAT GGTGCAGGCGCT GATT CTTT CAAGAAT GAT GGAGT CTACT CCAGGTAT TT TACAGCATAT 
ACAGAAAATGGCAGATATAGCTTAAAAGTTCGGGCTCATGGAGGAGCAAACACTGCCAGGCTAAAATTACGGCCTCCA 
CTGAATAGAGCC G C GT ACATACCAGGCT GGGTAGT GAACGG GGAAATT GAAGCAAACCCGC CAAGACCT GAAATT GAT 
GAGGAT ACT CAGACCAC CTT GGAGGATT T CAGCC GAACAGCAT CCGGAGGT GCATT TGT GGT AT CACAAGT CCCAAGC 
CTTCCCTTGCCT GAC CAAT AC CCACCAAGT CAAAT CACAGACCTT GAT GC CAC AGT TCAT GAGGATAAGATTATT CTT 
ACAT GGACAGCACCAGGAGATAATTTT GAT GT TG GAAAAGT T CAACGTTATAT CATAAGAATAAGT GCAAGT ATT CTT 
GAT CTAAGAGACAGTTTTGAT GAT GCTCTTCAAGTAAA.TACTACTGATCTGTCACCAAAGGAGGCCAACTCCAAGGAA 
AGCT TT GCAT T TAAACCAGAAAATAT CT CAGAAGAAAAT GCAACCCAC AT ATTT ATTGC C ATTAAAAGTAT AGATAAA 
AG C AATT T GAC AT CAAAAGTATCCAACATT GCAC AAGT AACTTT GTTT AT C CCT CAAGCAAATCCTGATGACATT GAT 
C CT AC ACCTACTC CTACT CCTACT C CTACT C CT GAT AAAAGTCATAATT CT GGAGT TAATATTT CT ACGCT GGT ATTG 
T CT GT GATT GGGT CT GTT GT AAT T GTTAACTTTATTTTAAGTACCACCAT T TGAACCTT AAC GAAGAAAAAAAT CT T C 
AAGTAGACCTAGAAGAGAGTTTTAAAAAACAAAACAATGTAAGTAAAGGATATTTCTGAAT CTTAAAATT CATCCCAT 
GTGT GATCAT AAACT CATAAAAAT AATT TTAAGATGTCGGAAAAGGATACTTT GAT TAAATAAAAACACT CAT GGATA 
T GTAAAAAC T GTCAAGAT T AAAAT TTAAT AGTT T CATT TATTT GTT ATT TTAT TT GTAAGAAATAGT GAT GAACAAAG 
ATC C TTTTT CATACT GATACCT GGTT GTAT ATT ATT T GAT GCAACAGTT TT CT GAAAT GATAT TT CAAATT GCAT CAA 
GAAATTAAAATCATCTATCTGAGTAGTCAAAATACAAGTAAAGGAGAGCAAATAAACAACATTTGGAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 7Q 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVS ILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKAS IMFMQSIDSWEFGNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVF'SLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTA7ENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
M SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 

m 

~jf I TWNSLP PS I S LWDPSGT IMENFT VDAT SKMAYLS I PGTAKVGTWAYNLQAKANPETLT I TVTSR 

AANS S VPP I T VNAKMNKDVNS FP S PMI VYAE I LQGYVPVLGANVTAF IES QNGHTEVLELLDNGA 
[|] GADSFKNDGVYSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAYIPGWWNGEIEANPP 
%l RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 

Ul FDVGKVQRYIIRISASILDLRDSFDDALQVNTTDLSPKEANSKESFAFKPENISEENATHIFIAI 

t*\ 

^ KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 

tx 

p| VNFILSTTI 

m 

hi I Signal peptide : 

^ amino acids 1-21 

Us . 

ill 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 



N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 

CT C CT TAGGT GGAAACCCTGGGAGT AGAGTACT GAC AGCAAAGAC CGG GAAAGAC CATAC GT CCC CGGGCAGG GGT GA 
CAACAGGT GT CAT CT TTTTGAT CTCGT GTGTGGCTGCCTT C CTAT TT CAAGGAAAGAC GCCAAGGTAATTT T GACCCA 
GAGGAGCAATGAT GTAGCCACCTCCTAACCTT CCCTTCTT GAACCCCCAGTTATGCCAGGATTTACTAGAGAGTGTCA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GTGCT CAGAAGTGACTTGCCTGAGGGT GGACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTT GGCT GCACATCAGGAA 
GGCTGTGATGGGAATGAAGGTGAAAACTTGGAGATTTCACTTCAGTCATTGCTTCTGCCTGCAAGATCATCCTTTAAA 
AGTAGAGAAGCTGCT CTGTGTGGTGGTTAACTCCAAGAGGCAGAA.CTCGTTCTAGAAGGAAATGGAT GCAAGCAGCT C 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC 
C GCAACTACGT GAGC AGCCT GAAGCGGCAGATC GCACAGCT CAAG GAG GAGCT GCAGGAGAGGAGT GAGCAGCTCAG G 
AAT GGG CAGTAC CAAGC CAGC GATG CT GCT GG CCT GGGT CT GGACAGGAGC CC CCCAGAGAAAAC CCAGGCCGACCT C 
CT GGC CTT C CT GCACT CGCAGGT GGACAAGGCAGAGGT GAATG CT GGC GT CAAGCT GGCCACAGAGT ATGCAG CAGT G 
C CT TT CGATAGCT T T ACT CTAC AGAAGGTGTACCAGCT GGAGACT GGCCTTACCC GC CACCC C GAGGAGAAG C CT GT G 
AGGAAGGACAAGC GG GAT GAGTT GGT G GAAGC CATT GAAT C AGCCTTGGAGACCCT GAACAAT CCT GCAGAGAACAGC 
C CCAAT CACC GT C CTT ACAC GGC CTCTGAT TT CATAGAAGGGAT CTACCGAACAGAAAGGGACAAAGGGACAT T GT AT 
GAGCT CAC CTT CAAAGGG GACCACAAACACGAATT C AAACG GCTCATCTTATTTCGAC CATTCAGCC CCAT CAT GAAA 
GT GAAAAAT GAAAAGCTCAACAT GGCCAACACGCTT AT CAATGT T AT CGT GCCT CTAGCAAAAAGGGT GGACAAG TTC 
C GG CAGTT CAT GCAGAATTT CAGGGAGAT GT GCATT GAGCAGGAT GGGAGAGT CCATCT CACT GTT GTTTACT TT GGG 
AAAGAAGAAATAAAT GAAGT CAAAG GAATACTT GAAAACACT T CCAAA.GCT GC CAACT T CAGGAACTT T ACCTT CAT C 
CAGCT GAAT GGAGAAT TT T CT CGGG GAAAGGGACT T GATGT T GGAGCC CGCTT CT G GAAGGGAAG CAAC GTCCTTCTC 
TTTTTCTGTGATGT GGACAT CT ACTT CACAT CTGAATTCCT CAATACGT GTAGGCT GAATACACAGCCAGGGAAGAAG 
GTAT TT TATC CAGTTC TTTT CAGT CAGTACAAT C CT GGCATAATAT AC GGC CACCATGAT GCAGT C CCT CCCTT GGAA 
CAGCAGCT GGT CATAAAGAAGGAAACTG GATTTT G GAGAGACTT T GGATTT GGGAT GAC GT GT CAGT AT C GGT CAGAC 
TT CAT CAAT AT AGGT GGGTTT GAT CT GGACAT CAAAGGCT GGGGCG GAGAGGAT GT GCAC CTTTAT CGCAAGTAT C T C 
CACAGCAACCTCATAGTGGTACGGACGCCTGTGCGAGGACTCTTCCACCTCTGGCATGAGAAGCGCTGCATGGACGAG 
C TGAC CCC CGAGCAGTACAAGAT GT GCAT G CAGT CCAAGGC CAT GAACGAGGCATCCCACGGCCAG CT GGGC AT GCT G 
GTGT T CAGGCACGAGATAGAGGC T CACCTT C GCAAACAGAAACAGAAGACAAGTAGCAAAAAAAC ATGAACT CC CAGA 
GAAGGATT GT GGGAGACACTTTT T CT TT CCTTT T GCAATTACT GAAAGT GGCT GC AAC AGAGAAAAGACT T C CATAAA 
GGACGACAAAAGAATT GGACT GAT GGGT CAGAGAT GAGAAAGC CT CCGAT TTCT CT CT GTT GGGCT TT TTACAACAGA 
AATCAAAATCTCCGCTTTGCCTGCAAAAGTAACCCAGTTGCACCCTGTGAAGTGTCTGACAAAGGCAGAATGCTTGTG 
AGATTATAAGCCTAATGGTGTGGAGGTTTTGATGGTGTTTACAATACACTGAGACCTGTTGTTTTGTGTGCTCATTGA 
AATATT CAT GATT TAAGAGCAGT TTT GTAAAAAATT CATTAGCAT GAAAGG CAAGCAT ATT TCT CCT CATAT GAAT GA 
GCCTAT CAGC AGGGCT CT AGT TT CTAGGAAT GCT AAAATAT CAGAAGGCAG GAGAGGAGATAG GCT TAT T ATGATACT 
AGT GAGTACATTAAGTAAAATAAAATGGACCAGAAAAGAAAAGAAACCATAAATATCGTGTCATATTTT CCCCAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTTTTTCCCTTGT GAGTTATAGT CT GCT T AT TT AATTAC CACT T T G CAAGC CTTACAAGAGAGCACAAGTT GGC CTAC 
ATT T TTAT ATT TTTT AAGAAGATACTTT GAGAT GCATT AT GAGAACT TT CAGTT CAAAGCAT CAAAT T GAT GCCAT AT 
C CAAGGACAT GC CAAAT GCT GAT T CT GT CAGGCACT GAAT GT CAGGCATTGAGACATAGGGAAGGAAT G GT TTGTACT 
AATAC AGACGT ACAGATACTTT CT CT GAAGAGTATTTT C GAAGAGGAGCAACT GAACACTG GAGGAAAAGAAAAT GAC 
ACTTTCTGCTTTACAGAAAAGGAAACTCATTCAGACTGGTGATATCGTGATGTACCT7W^AGTCAGAAACCACATTTT 
CT CCTCAGAAGT AGGGACCGCTT T CT TAC CT GT T TAAAT AAAC CAAAGTAT AC CGT G T GAAC CAAAC AATCT CT T TT C 
AAAACAGGGT GCT CCTCCTGGCTTCTGGCTTC CAT AAGAAGAAATGGAGAAAAATATAT ATAT ATATATATAT ATT GT 
GAAAGATCAATCCATCTGCCAGAATCTAGTGGGATGGAAGTTTTTGCTACATGTTATCCACCCCAGGCCAGGTGGAAG 
T AACTGAAT T ATT T TT TAAAT TAAGCAGT TCTACTCAAT CACCAAGAT GCTT CT GAAAATT GCATTTTATTACCATT T 
CAAACT ATT TT T TAAAAATAAATACAGTT AAC AT AGAGT GGT TT CTT CAT T CAT GT GAAAAT T ATT AGC CAGCACC AG 
AT GCAT GAGCT AATTAT CT CT TT GAGT C CTT GCTTCT GTTT GCT CACAGTAAACTCATT GTTT AAAAGCTT CAAGAAC 
ATT CAAGCTGTT GGTGT GTTAAAAAAT GCATT GTAT T GATTT GTACT GGTAGTTTAT GAAATTTAATTAAAACACAGG 
C CAT GAAT GGAAGGTGGT ATT GCACAG CTAAT AAAATAT GATTT GT G GATAT GAA 



FIGURE 72 

M^IMVRRGLIAWISRVWLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQEWE 
EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
EN S PNHRP YT AS DFI EG I YRTERDKGT L YELT FKGDHKHE FKRL I L FRP FS P IMKVKNEKLNKAN 
TLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYFGKEEINEVKGILENTSKAANFRNF 
T FI QLNGE FSRGKGLDVGARFWKGSNVLL FFCDVDI Y FTSE FLNTCRLNT QPGKKVFYPVL FS QY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNLIVTOTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHL 
RKQKQKT S S KKT 

Important features : 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites . 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 73 

GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGAT T TGTCCTGGGGATCCA 
GAAACCCAT GATACCCT ACTGAACAC CGAAT CC CC TGGAAGCC CACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAAT GCCCAGTCGCCCAT CGAT AT T CAGACAGACAGTG TGACAT T T GACCCTGAT T T GCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTGCCCGAAAATATGTAGCT 
GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
T GAAGC C AC AT T T GC AGAG C T C C AC AT T G T AC AT TAT GAC T C T GAT T C C TAT G AC AGC T T GAG T G 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AAT ATAGC T TAT GAACACAT TC TGAGTCAC TTGCATGAAGT CAGGCAT AAAGAT CAGAAGACCT C 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
ATT T CAAT GGAACAGCT GGAAAAGCT TCAGGGGACATT GT T CT CCACAGAAGAGGAGCCCT CTAA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGTGTGGTCTT CACCTCAGCACAAGCCACGACT GAGGCATAAAT TCCT TCTCAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAGGAAT 
GGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTAGGAGGAA 
ATGAGGAAATCGCT GT GT T GTTAATGCAGAGANCAAACT C T GT T TAGT TGCAGGGGAAGTT T GGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
C T CC TTAGGATAAAGAGT TGCT GT TGAAGT TGTATAT T T T T GAT CAAT AT AT T T GGAAAT TAAAG 
TTTCTGACTTT 



FIGURE 74 

ML FSALLLEVI WI LAADGGQHWT YEGPHGQDHWPAS YPECGNNAQS P I D I QT DS VT FDPDL PALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFTSAQATTEA 

U« Important features of the protein: 

pf Signal peptide: 

01 amino acids 1-15 

I = ? 

t '* Transmembrane domain: 

amino acids 291-310 

W 

ilj N-glycosylation site. 

Ul amino acids 213-216 

S Eukaryotic-type carbonic anhydrases proteins 

fjj amino acids 197-245, 104-140, 22-69 
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FIGURE 7fi 

TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTAZ^CCTGCAAACATCACCTTCTTATCCATCAACATGAAGAATGTCCTACA 

AT GGACTCCACCAGAGGGT C T T CAAGGAGT T AAAG T TAC T TACACT GT GC AG T AT TT CAT CAC AA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGTGGAAGAGAAATCCAGAAGAC CTTCCTGTTTC CAT GCAACAAATATACTCCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
Gl CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
% T CT GT GATGGGC T AT TCCAT C T AC CGAT ATAT CCACGT T GGCAAAGAGAAACAC CCAGCAAAT T T 

GATTTTGATTTATGGAAATGAATTTGACAAflAGATTCTTTGTGCCTGCTGAAAAAATCGTGATTA 
Sj ACTTTATCACCCTCAATATCTCGGATGATTCTAAAATTTCTCATCAGGATATGAGTTTACTGGGA 
ill AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
GGAAGAGGAGGTGAAACATTTAGGGTATGCTTCGCATTTGATGGAAATTTTTTGTGACTCTGAAG 
*„ t AAAACACGGAAGGTAC T TC TC T CAC CCAGCAAGAGT CCC TCAGCAGAACAATACCCCCGGAT AAA 

f|| ACAGTCATTGAATATGAATATGATGTCAGAACCACTGACATTTGTGCGGGGCCTGAAGAGCAGGA 
Q GC T C AG T T T GCAGGAGGAGGT G T CCACACAAGGAAC AT TAT TGGAG T C GC AGGCAGCGT T GGCAG 

ffl TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
^ CAGGAGCACACAGAC TCGGAGGAGGG GCCGGAGGAAGAGCCAT C GAC GACCCT GGTCGAC T GGGA 

rif 

TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAATGAAACCTATCTCATGCAAT TCAT GGAGGAATGGGGGT TATA 
TGTGCAGATGGAAAAC TGA TGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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FIGURE 76 

MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYNVSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQS SE FKAKI I FW YVLP I S I T VFL FSVMGYS I YRY I HVGKEKHPANL I LI YGNE FDKRFFVPAEK 
I VINFI TLNI SDDSKI SHQDMSLLGKS SDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLME I FC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGGATCGGGC 
T GATCAATGAAAAGGC TGCAGATAAGCT GGGATG TACCCAGAT CGT GAAGATCCT AAC TCAGGAC 
ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
TTTACACCAAAGGTGACCAACT TATACTCAACT TGAATAACAT CAGCTCTGATCGGATCCAGCTG 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 



APP ID=10063730 



Page 231 



FIGURE 78 

MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGGI PVLGSLVNTVLKHI IWLKVT TANILQLQVKPSANDQELLVKI PLDMVAGFNTPLVKT IVE 
FHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLVNALAKQVMNLLVPSLPNL 
VKNQLCPVTEAS FNGMYADLLQLVKVPISLSIDRLEFDLLYPAIKGDTIQLYLGAKLLDSQGKVT 
KW FNNSAAS LTMP T LDNI P FS L I VS QDWKAAVAAVL S PEE FMVLL DS VLPE S AHRLKS S I GL IN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQLILNLNNISSDRIQLMNSGIGWFQPDVLKNIITEIIHSILLPNQNGKLRSGVPVSLVKALG 
FE AAE SSL TKDALVL T PAS LWKPS S PVS Q 

Important features of the protein: 
Signal peptide : 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 

GAGAGAAGTCAGCCTGGCAGAGAGACTCTGAAATGAGGGATTAGAGGTGTTCAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTAGGATGCCTAC 
CAAGCCCAACCTCTTGCCACAAGGAGCTC.TCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 
GTCAGAAGGTGCTGCTGAGGATAGACTGACTTTGGCCATTGGATTGAGCAAAGGCAGAAATGGGG 
GC TAGT G T AAC AGC ATGC AGGT T GAAT T GCC AAGGAT GC TCGCCAT GC CAGC C TTTCTGTTTTCC 
TCACCTTGCTGCTGCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
GGACTCAGAGGATCCCTTTGCCCTCTGGTTTAGCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACT AAT T T GT GCAT GAAC TGAAATAAAACCAT CC TACGGTAT CCAGGGAACAGAAAGCAGGAT G 
CAGGAT GGGAGGACAGGAAGGCAGCCT GGGACAT T TAAAAAAATA 
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FIGURE 80 



MASLGLQLVGYI LGLLGLLGTLVAMLLPSWKTS S YVGAS I VTAVGFSKGLWMECATHS TGI TQCD 
I YS TLLGLPAD I QAAQAMMVTS S AI SS liAC 1 1 S WGMRC TVFC QE SRAKDRVAVAGGVFF I LGGL 
LGFI PVAWNLHGI LRDFYSPLVPDSMKFE I GEALYLGI I SSLFSLI AGI I LCFSCSSQRNRSNYY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 4 6-59 



APP_ID=1 0063730 



Page 234 



FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 

CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 

GCCCACCGCTGCTTCCT.GGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 

GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 

ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 

CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 

CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 

ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 

GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 

TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACG(SGACCATGTACCAACACGGAG 

AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 

ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 

ACTGCCAGACTCCTGCTGCCAAGCCTGCAAAGATGAGGCAAGTGAGCAATCGGATGAAGAGGACA 

GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 

AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 

CAGACCCAAGGGAGCAGGCAGCACAACTGT CAAGATCGTCC TGAAGGAGAAACATAAGAAAGCCT 

GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 

CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 

CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 

ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 

CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 

CTCGGACT TGGT GGAGAT C T ACCT CT GGAAGC TGGTAAAAGAT GAGGAAAC TGAGGC TCAGAGAG 

GTGAAGTACCTGGCCCAAGGCCACACAGCCAGAATCTTCCACTTGACTCAGATCAAGAAAGTCAG 

GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 

GGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGACAAAGTGACCAAG 

ACATAACAAAGACCTAACAGTTGCAGATATGAGCTGTATAATTGTTGTTATTATATATTAATAAA 

T AAGAAGT T GCAT TACCC TCAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 82 

MVPEVRVLSSLLGIiALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 82 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGGCTCTCATGCTCAGTTTGGTTCTGAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CC AGTT CT CT AGCGTGGT CC ACCT CT AC AGGG ACGGGAAGGACCAGCC AT TT AT GC AGAT GC CACAGT AT C 
AAGGCAGG AC AAAACT GGTG AAGG AT TCT ATT GCGGAGGGGC GC AT CT CT CT GAGGCT GGAAAAC AT T ACT 
GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
AC AGGT GT C AGC ACTGGGCT C AGT TC CTCTC AT TT C CATC ACGGGAT AT GT T GAT AGAG AC ATC C AGCTAC 
TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
AC AGAC TC C AGG AC AAAC AG AGAC AT GC AT GGCCT GTT TGAT GT GG AG AT CT CT CT GACC GT CC AAGAGAA 
CGCCGGGAGCATATCCTGTTCCATGCGGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAG 
GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 
GAGAAGAAAGCACGGACAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAG 
AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 
CCTC ACT CT GAG AAGAGAT T TAC AAG GAAG AGT GTGGT GGCT TC TCAG AGT TTCCAAGC AGGG AAAC ATT A 
CT GGGAGGT GGACGGAGG AC ACAATAAAAGGT GGC G CGT GGG AGTGTGCC GGGATGATGT GGAC AGGAGG A 
AGGAGTACGTGACTTTGTCTCCCGATCATGGGTACTGGGTCCTCAGACTGAATGGAGAACATTTGTATTTC 
ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCTTATTGAGGCCCTACATTGAGTATCCGTCCTATAATGAGCAAAATGGAACTCCCATAGTCATCTGC 
CCAGTCACCCAGGAATCAGAGAAAGAGGCCTCTTGGCAAAGGGCCTCTGCAATCCCAGAGACAAGCAACAG 
TGAGTCCTCCTCACAGGCAACCACGCCCTTCCTCCCCAGGGGTGAAAT GTAGG ATGAATCACATCCCACAT 
T CT T CT TT AGGG AT AT T AAGGT CT CT CT CC CAGAT C CAAAGT CC CGCAGC AGCC GGC CAAGGT GGC TT CCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CT GAC ATT ACAT TT AGT T T GCT CT CACT CC AT CT GGCT AAGT GATCTT GAAAT ACC ACCT CT C AGGTGAAG 
AACCGTCAGGAATTCCCATCTCACAGGCTGTGGTGTAGATTAAGTAGACAAGGAATGTGAATAATGCTTAG 
ATCT T AT TGATGAC AG AGT GTATCC T AATGGT T T GT TC ATT ATATT AC AC TTT C AGT AAAAAAA 
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FIGURE 84 

MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSWH 
LYRDGKDQPFMQMPQYQGRTKLVKDS IAEGRI SLRLENI TVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGSVPLISITGYVDRDIQLLCQSSGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVEISL 
TVQENAGS I SC SMRHAHLSRE VE SRVQ I GDT FFE P I S WHLATKVLG I LCCGLFFG I VGLKI FFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PI VI CPVTQE S EKEAS WQRAS AI PET SNSE S S S QATT PFLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 85 

AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATCCTGCTGCTGCTGCTGCCCCT 

GCT CT GGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGAT GCAGAGT TCCGTGA 

CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 

TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 

AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 

GGGACCCACATACCAAGAATTGCACCCTGAGCATCAGAGATGCCAGAAGAAGTGATGCGGGGAGA 

TACTTCTTTCGTATGGAGAAAGGAAGTATAAAATGGAATTATAAACATCACCGGCTCTCTGTGAA 

TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 

AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 

GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 

GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 

ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 

GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 

GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 

GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 

CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 

GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 

CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 

TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 

AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 

CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 

AAGCCTTGGGACTGGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 

ATGAGAAACTGCAGAGACTCACCCTGATTGAGGGATCACAGCCCCTCCAGGCAAGGGAGAAGTCA 

GAGGCTGATTCTTGTAGAATTAACAGCCGTCAACGTGATGAGCTATGATAACACTATGAATTATG 

TGCAGAGTGAAAAGCACACAGGCTTTAGAGTCAAAGTATCTCAAACCTGAATCCACACTGTGCCC 

T CCC T T T TAT T T T TT TAAC TAAAAGAC AGACAAAT T CC TA 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANTDQDAPVATNNPARAWEETRDRFHLLGDPHTKNCTLSIRDARRSDAGRYFFRMEKGSIKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLSFCVIFVWRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 

AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CC AAGGAAAGT GC AGC T GAGAC T C AGAC AAGAT T ACAATGAACCAAC TC AGCT TCCTGCTGTTTC 
TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACTTACTTCAAGGAATGGACCTGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
CAACTACAACACCT TTGGATCTGCAGAGGCGGCCACGAGCGAT GACTACAAGAACCCTGGCTACT 
ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 
G T T T GGCATC TACCAGAAATATCCAG TGAAATATGGAGAAGGAAAGT GTT GGAC TGACAACGGCC 
CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
GGCCAGCGGGAATTCACTGCGGGAT TTGT TCAGTTCAGGGTAT TTAATAACGAGAGAGCAGCCAA 
CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
GATACTT TCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTT TTCT GGT TT TGATTGGAGTGGATAT 
GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
TTGAGAGTTTTGTGGGAGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAA 
CAAC T TACCCAGTAGCTAGAAT GT TAAT GGCAGAAGAGAAAACAAT AAAT CATAT TGAC TCAAGA 
AAAAAA 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDETVNTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKMYPEGDGNWANYNTFGSAEAAT 
S DD YKNPGY YD I QAKDLG I WHVPNKS PMQHWRNS S LLRYRTDT G FL QT LGHNL FGI YQKY P VKYG 
EGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHC I GGGGY FPEAS PQQCGDFS GFDWSGYGTHVGYS S SRE I TEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site, 

amino acids 163-167 

Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 

CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
T T AAGACT TATATACAG T T T TAGGGGACAAT TAAAAAAAAAAAA 
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FIGURE go 

MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYIFVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE Qi 



CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATCCAACTGAC 

TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 

GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 

CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 

GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 

CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 

GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 

CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 

AGTTCCACCAGGAACAGCAGATCTTCATCGAAGCCAAGGCCTCCAAAATCTTCAACTGCCGGATG 

GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 

CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 

TCTACATCGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 

CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 

TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 

AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGGCAGGGCCAAGTCTCAAGTGGCAG 

AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 

AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 

TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 

CC T T GGT T CTT GCCATCC TGAGGAAAGAT AGCAACAGGGAGGGGGAGAT T T CATCAGT GTGGACA 

GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 

GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 

CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 

CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 

GCACACCCATCCTTAAGCTAAGACAGGACGATTGTGGTCCTCCCACACTAAGGCCACAGCCCATC 

CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 

GAGAGGGGTCCGTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 

CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 

TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 

TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNISISLVPPSKA.VEFHQEQQIFIEAKASKIFNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 



Important features of* the protein: 
Signal peptide: 

amino acids 1-14 



f H N-glycosylation sites. 

ill amino acids 62-65, 127-130, 137-140, 143-146 

%\ 

^ 2-oxo acid dehydrogenases acyltransferase 

amino acids 61-71 
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FIGURE Q3 

CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGAT GGAC CAACACAGAAATAT CT GC TGAT C T T TGGAGCGT T TGTCTCTGTC TAT ATCCAAGAA 
ATGTTC CGAT T TGCATAT TATAAAC TCT T AAAAAAAGCCAGTGAAGGTT T GAAGAGTATAAACC C 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
^ GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
^ ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
m TC C T GAC CCACCT GC TGGTGTCAGCCCAGACCT TCATAAGT T CT TAT TAT GGAATAAACC TGGC G 

y| TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCT GAAAC TC TGCC T GCT CTGC CAAGACAAGAAC TTTCTTCTT T ACAACCAGCGCT CCAGAT 
AAC C TCAGGGAACCAGCAC TTCCCAAACCGCAGAC TACATCT T TAGAGGAAGCACAAC TGT GCCT 

CI 

T T T T CT GAAAATCCC T T T T TC TGG T GGAAT T GAGAAAGAAA.TAAAACTAT GCAGATA 

us 

ill 
ru 
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FIGURE qa 

MT AAVFFGCAF I AFGPALAL YVFT I AI E PLRI I FL IAGAFFWLVSLL I S S LVWFMARVT IDNKDG 
PTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQT FI S S YYG INLAS AFI I LVLMGTWAFLAAGGS CRS LKLCLLCQDKNFLL YNQRS R 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE Q5 



AAT T T T T C AC C AG AGT AAAC T T G AG AAAC C AAC T GG AC C T T GAG TAT T G T AC AT TTTGCCTCGTG 
GAC CCAAAGGTAGCAATC TGAAACATGAGGAGTAC GAT TC TACT GT T T TGTCT T CTAGGATCAAC 
T CGG TCATTACCACAGC T CAAA.CCT GCT T T GGGACT CCC T CC CACAAAACTGGC TCC GGAT CAGG 
GAACAC TACCAAACCAACAGCAGT CAAAT CAGGTCT TT CC T TC T T T AAGT C TGATAC CAT TAACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 
CC AGACC CACCCAT TGACCCT GGGAGGGT T GAATG T ACAACAG CAAC TGCACCCACATGT GT TAC 
CAATTTTTGTCACACAACTTGGAGCCCAGGGCACTATCCTAAGCTCAGAGGAATTGCCACAAATC 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
% TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
%l GAACCCCAGCAGGCCGCCTCCCAACT CC CAGT GGCACAGATGACGAC T T T GCAG TGACCACCCC T 

yj GCAGGCATCCAAAGGAGCACACAT GCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAAT TCA 

S\ GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
AT T GAT TAT AT TATGGAAT AGAT T GAGACACAT TGGATAGTCT TAGAAGAAAT TAAT T CT TAAT T 
TAC C T GAAAAT AT T C T T GAAAT T T CAGAAAAT ATG T TCT AT GT AG AGAAT C CCAAC T T T T AAAAA 
C AATAAT T CAATGGATAAATCTGTC T T T GAAATATAACAT T AT GCTGCCTGGAT GATATGCATAT 
T AAAACATAT T TGGAAAACTGGAAAAA?WIAAAAAAAAAAAAAAAAAAAAAAA^ 
Q AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

m 
ci 
in 
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FIGURE q6 

MR.STILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSG 
TDDD FAVT T PAG I QRS THAI EEATTE SANG I Q 

Signal peptide: 

amino acids 1-16 



SI 

jiBSS 

; y 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGAT GAGCCAGGCACAGCCAAC TCGGAT C T TCGAT T CCACAT CCTGAGC CAGGC TC CAGC C CAGC 
U CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
ACCAGGCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
CP CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
U| TGCAGAGGGAAAC C T CTACGT GACCAGAGAGC T GGACAGAGAAGCCCAGGCTGAGTACCT GCTCC 

Cj AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
. ]\ ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
*M CAGTCGACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 
CI ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
« CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
CCTGCT.TCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
f£ TCGAAGTCGCAGTCACAGATAT CAAT GAT CACGCCCC TGAGTT CAT CAC T T CCCAGATTGGGCCT 

ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
kA CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 
f|! GCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACCTCAGTTATGAG 
f% GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCGAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
AT TT TCACCCACTGGACCATGTCAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGTGCCCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCCTGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE q8 



MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
STSLDHALERTYQLLVQVKDMGDQASGHQATATVEVS I IESTWVSLEPIHLAENLKVLYPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLFAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEWVWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQESYEASVPISAPAGSFLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REH II PWVS HNAQMWQLLVRVI VCRCNVEGQCMRKVGRMKGMPTKLSAVGI LVGTLVAI G I FL I 
LI FTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE qq 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGT CCAAGAT T CT T CCCAGGAACACAAACGT AGGAGAC CCAC GC TCC T GGAAGCACCAGCC T T T A 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCTAGGACCCAGGCATCTTGCTT TCCAGCCACAAAGAGACAGATGAAGATGCAGAAAGGAAAT G 
T TC T CC T TAT G T T TGGTC TAC TAT T GCAT T TAGAAGCT GCAACAAAT TCCAATGAGAC TAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGTCACCAACTCTGAGTTCCATACAACGTCCAGTGGGATCAGCACAGCCACCAACTCTGAG 
TT CAGCACAGCGTCCAGTGGGAT CAGCAT AGCCACCAAC TCT GAGT CCAG CACAACC T CCAGT GG 
GGCCAGCACAGCCACCAACTCTGAGTCCAGCACACCCTCCAGTGGGGCCAGCACAGTCACCAACT 
CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
AGTAGGGCCAGCAC T GCCACCAAC T CTGAGT C TAGCACAC TC TCCAGT GGGGCCAGCACAGCCAC 
CAACT C T GACT CCAGCACAACC TCCAGT GGGGCTAGCACAGCCACCAACT C T GAGTCCAGCACAA 
O CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
0! GCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAG 
hi . AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 

Ti GCACAGCCACCAACTCTGACTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAG 
/I TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
^ GGCTAGCACAGCCACCAAC TCT GACT CCAGCACAACC TCCAGT GGGGCCGGCACAGCCAC CAACT 

£3 CTGAGTCCAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACACCCTCC 
a AGTGGGGCCAACACAGCCACCAACTCTGAGTCCAGTACGACCTCCAGTGGGGCCAACACAGCCAC 
CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 
?K CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
Wj GCCAC CAACT C T GACTCCAGCACAACCT CCAGT GAGGCCAGCACAGCCAC CAAC TCT GAGTC TAG 

Q CACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACAACCTCCAGTGGGGCCA 
n| ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
5J| GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
* ^ TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GTGGAGTCC TAAC T GG T TC TGGAGGAGACCAGTAT CATC GATAGCCAT GGAGATGAGCGGGAGGA 
ACAGCGGGCCC TGA GCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
C CCAAGACCT GGT T T CC T T TCAT T CAT CC CAGGAGAC CCC TCCCAGCT TT GT TTGAGATCC T GAA 
AAT CT TGAAGAAGGTAT TCC T CACC T TTCTTGCCTT TACCAGACAC T GGAAAGAGAATAC TAT AT 
TGCT CAT T TAGC T AAGAAAT AAAT ACAT C T CAT C T AAC AC ACAC GACAAAGAGAAGC T G T G C T T G 
CCCCGGGGT GGG TAT C TAGC T C T GAGATGAAC T CAG T TATAGGAGAAAACC T CCATGC TGGAC T C 
CAT CT GGCAT T CAAAATC T CCACAGTAAAAT C CAAAGACC TCAAftAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 100 



MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTV.TNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSE S S TT S S GVS TATNSE S ST T S S GAS TATNS DS S TT S S EAS T ATNSES STVS S G I S TVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGLFFCVRNSLSLRNT FNTAVYHPHG.LNHGLGPGPGGNHGAPHRPRWS PNWFWRRPVS S I 
AMEMSGRNSGP 

Signal peptide: 



'•M amino acids 1-20 



fH 



Transmembrane domain: 

Ml. amino acids 510-532 



rfJ 
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FIGURE 101 

GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGC T G GAACTAT CAGGC TGAAAAACAGAGT GGGTAC TCTCTTCT GGGAAGC TGGCAACAAAT 
GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AACGAGGCAGT T TCTAGC TACT GCACGTACT TCATAAAGCAGGACT CT AAAAGC T T T GGAATCAT 
GGTG TCAT GGAAAGGGAT T TAC TT TATACT GAC TC T GT T T TGGGGAAGCT T T T T T GGAAGCAT T T 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
M CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 

W GAT T AT AAC T GGGGAT GCAT TTGTTCCTG GAGAAAGAAG T G T CAT TAT CAT GAAC CAT C G GAC AA 

r'% 

'% GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 

\,h 

1 5 1 TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
%\ CTTCATTCATAGGAAATGGAAGGATGACAAGAGCCATT TCGAAGACATGAT TGAT TACT TTTGT G 

111 ATAT TCACGAACCAC T TCAACT CCT CATAT TCCCAGAAGGGAC T GAT C T CACAGAAAACAGCAAG 

T CT CGAAGT AATGCAT T T GCT GAAAAAAAT GGAC T T CAGAAAT AT GAATAT GT T T TACATCCAAG 
AACT ACAGGCT T TAC TT T TGTGG T AGACC GTC TAAGAGAAGGT AAGAACC T T GAT GCT GTC CAT G 
\ M ATAT CACT GT GGCGTATCC TCACAACAT T CC T CAAT CAGAGAAGCACC TC C T CCAAGGAGAC T T T 



CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 
CCT T CAAC TCTGGT GCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCT GCGT TC C T TC TAT CAAG 
GGGAGAAGAATTTTTATTTTACCGGACAGAGTGTCATTCCACCTTGCAAGTCTGAACTCAGGGTC 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATTTGTACAGTCTTGTTAAGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGCAAGAGA 
GAATAT T T GGTGGAC TGGAGAT CATAGAAC T TGCATGT TAC CGACT T T TACACAAACAGCCACAT 
T T AAAT T CAAAGAAAAATGAGTAAGAT T AT AAGG T TT GCC AT GT GAAAACC TAGAGCATAT T T TG 
GAAAT GT T CT AAACC T T T C TAAGC T C AGAT GCAT T T T TGC AT GAC TAT GT CGAATAT T TC T TAC T 
GCCATCATTATTTGTTAAAGATATTTTGCACTTAATTTTGTGGGAAAAATATTGCTACAATTTTT 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 
GGGCCAGAATAT TAT TAAACAATCAT CAGGCT T T TAAA 
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FIGURE 102 

MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKS FGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEPCNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVI PPCKSELRVLWKLLS I LYWTLFS PAMCLL I YL YS LVKWYFI I T I VI FVLQERI FGG 
LEIIELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 



APP ID=10063730 



Page 256 



FIGURE IPS 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 

T CCAAAT CAT CCATCCACCCC TGC T GT CATC T GTT T TCATAGT GT GAGAT CAACCCACAGGAATA 

TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 

CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 

TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 

CACC TCTACAGAGAT GGGGAAGAC TGGGAAT C TAAGCAGATGCCACAGTATCGAGGGAGAACTGA 

GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 

ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 

CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 

GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 

AGGATTTGTCTTCAGACTCCAGAGCAAATGCAGATGGGTACAGCCTGTATGATGTGGAGATCTCC 

ATTATAGTCCAGGAAAATGCTGGGAGCATATTGTGTTCCATCCACCTTGCTGAGCAGAGTCATGA 

GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 

TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 

AAAT CCAAAGGGAAAAT CCAGGCGGAACT GGAC TGGAGAAGAAAGCAC GGACAGGCAGAAT TGAG 

AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 

TTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGA 

TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 

CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 

ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 

ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 

GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 

TGACAT GT CAGT T TGAAGGCT TGT T GAGACCCTATATCCAGCATGC GATG TATGACGAGGAAAAG 

GGGAC T CCCATAT T CATAT GT CCAGT GTC CTGGGG ATGAG ACAGAGAAGAC CC T GC T TAAAGGGC 

C CCACACCACAGACCCAGACACAGCCAAGGGAGAGT GCT CCCGACAGG TGGCC CCAGC T T CC TC T 

CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 

TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 

T CAGAAGCCAT GGC TGCCC TGAAGTGGGGACGGAATAGAC TCACAT TAGG T T TAGTT TGT GAAAA 

CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 

ACAGTGATTCCTGCCTCACAGGTGAAGATTAAAGAGACAACGAATGT GAAT CATGCT TGCAGGT T 

T GAGGGCACAGT GT T TGC TAAT GAT GTGT T TT TATATT ATACAT T T T CC CACCATAAAC TC T GT T 

T GCT TAT T CCACAT TAAT T TACT TT T C T C T ATACCAAATCACCCAT GGAATAGT TAT TGAACACC 

TGCTTTGTGAGGCTCAAAGAATAAAGAGGAGGTAGGATTTTTCACTGATTCTATAAGCCCAGCAT 

TACCTGATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATCC 

C TCAT TAACACAGACACAAAAAT TCTAAATAAAAT T T TAACAAAT TAAACT AAACAAT ATAT T TA 

AAGATGAT AT ATAAC TAC TCAGT GT GGTT TGT CCCACAAAT GCAGAGT TGGT T TAATAT T TAAAT 

ATCAACCAGTGTAATTCAGCACATTAATAAAGTAAAAAAGAAAACCATAAA7VAAAAAAAAAAA 



FIGURE 1Q4 

MAFVL I LVLS FYELVS GQWQVTGPGKFVQALVGEDAVFS CS L FPET SAEAMEVRFFRNQFHAWH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
I VQENAGS I LCS IHLAEQSHEVESKVL I GET FFQPS PWRLAS I LLGLLCGALCGWMGMI I VFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQGFQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTLSPNNGYWVLRLTTEHLYFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains : 

amino acids 131-150, 235-259 
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j 5 5 



Q 

CI 

f 1 1 



CCTTCACAGGACTCTTCATTGCTGGTTGGCAATGATGTATCGGCCAGATGTGGTGAGGGCTAGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CTCACTGTTCATTATGTGAGATATAATCAAAAGAAGACCTACAATTACTATAGCACATTGTCATTTACAAC 
TG AC AAACT AT ATGCTGAGT TT GG CAGAGAGG CT TCT AAC AATT TT ACAG AAATGAGCC AGAGACTTG AAT 
CAATGGTGAAAAATGCATTTTATAAATCTCCATTAAGGGAAGAATTTGTCAAGTCTCAGGTTATCAAGTTC 
AGTCAACAGAAGCATGGAGTGTTGGCTCATATGCTGTTGATTTGTAGATTTCACTCTACTGAGGATCCTGA 
AACTGT AG AT AAAATTGT TC AACT T GTT T T AC AT GAAAAGCT GC AAGAT GCT GT AGGACCCCCT AAAGTAG 
AT CCTC ACT C AG TT AAAATT AAAAAAAT CAAC AAGACAGAAAC AGACAGC T AT CTAAACC AT T GCTGC GGA 
AC ACGAAG AAGT AAAACT CT AGGT C AGAGT CT CAGGAT CGTT GGT G GGAC AG AAGT AGAAGAGGGTGAAT G 
GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
TTGTGAGTGCTGCTCACTGTTTTACAACATATAAGAACCCTGCCAGATGGACTGCTTCCTTTGGAGTAACA 
AT AAAACCTT CGAAAATGAAACGGGGTCT C CGGAG AAT AATT GT CC AT GAAAAAT ACAAAC ACC CAT C AC A 
TGACTATGATATTTCTCTTGCAGAGCTTTCTAGCCCTGTTCCCTACACAAATGCAGTACATAGAGTTTGTC 



I s i TCCCT G AT GC AT CCT AT GAGTTT C AACCAGGT GAT GTG AT GTTT GT GAC AGG AT TT GG AG CACT GAAAAAT 

Q' GATGGT TAC AGT CAAAAT C ATCTT CG ACAAGC AC AGGT G ACT CT CAT AGAC G CT AC AACT TGCAAT GAAC C 

% TC AAGCTT AC AATGAC GC CAT AACTC CT AG AAT GTT AT GT GCTG GCTCCT T AGAAGGAAAAAC AGATGCAT 

U?$ GC C AGGGT GAC T CTGGAGGACCACTGGTT AGT T C AG ATGCT AG AGATAT C TGGT ACCT TGC T GGAAT AGT G 

AGCT GGGG AGAT GAAT GT GC GAAACC CAAC AAGCCT GGT GTT TAT ACT AGAGT TAC GGCCT T GCGGG ACT G 
G ATT AC TT C AAAAACT GGT AT CTAA GAGAC AAAAGC CT CATGGAAC AGAT AAC AT TTTTTTTTGTTTTTTG 
GG TGT GGAGGCC AT TT TT AG AG AT AC AGAATT GGAG AAGACT TGC AAAAC AGCT AG AT TT GACT GAT CTC A 
ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GAT C AACT CT GT CAT C TGTG AGCAAT AGT T GAAACT TT ATGT AC AT AGAG AAAT AGAT AAT AC AAT AT TAC 
ATTACAGCCTGTATTCATTTGTTCTCTAGAAGTTTTGTCAGAATTTTGACTTGTTGACATAAATTTGTAAT 
GCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTCAGCTCCTCTCATTTCAGCAAATATCCATTT 
TC AAGGT GC AGAAC AAGG AGTG AAAGAAAATAT AAGAAGAAAAAAATCCC C TAC AT TTT ATT GGC AC AGAA 
AAGT AT TAGG TGTT TT T C TT AGT GGAAT AT T AG AAATG ATCATATT C ATT ATGAAAGGT C AAGC AAAG AC A 
GC AG AAT AC C AATC ACT T C ATC AT TT AGGAAGT AT GGG AACT AAGT T AAGGAAG TCC AGAAAGAAGCC AAG 
AT AT AT CCTTATTTTCATTT CC AAAC AACT ACT AT GAT AAAT GT GAAG AAGAT T CT GT T T TT TTGT GACCT 
AT AAT AAT T AT ACAAACT T C ATGC AATGT ACT T GTT CT AAGC AAAT T AAAGC AAAT ATTT AT TT AACATT G 
T T ACTGAGGAT GTCAACAT ATAAC AATAAAAT AT AAAT CACCCA 
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MMYRPDWRARKRVCWEPWVI GLVI FI SL I VLAVC I GLTVHYVRYNQKKT YNYYS TL S FTTDKLY 
AEFGREASNNFTEMSQRLESMVKNAFYKSPLREEFVKSQVIKFSQQKHGVLAHMLLICRFHSTED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTAS FGVTIKPSKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTLIDATTCNEPQAYNDAITPRMLCAGSLEGKTDACQGDSGGPLVSSDARDIWYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAATGTT CTCACTTTAACTGCAGAAACGAGGGTAGAGGAAGCAGT CATT T TGACTTACTT TCCT . 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GATATGGTCACTTTGAAAGCCAGGATGACAAATTATGGATTACCTAGATATCGGTGGCTTACTCA 
L„% T GC T T GGAAT T T TT T TCAGAGAGAGT T T AAGTGCT GT GGAG TAG TAT AT T T CACTGAC TGGT TGG 

P« AAATGACAGAGATGGACTGGCCCCCAGATTCCTGCTGTGTTAGAGAATTCCCAGGATGTTCCAAA 
CAGGCCCACCAGGAAGATCTCAGTGACCTTTATCAAGAGGGTTGTGGGAAGAAAATGTATTCCTT 
W T TT GAGAG GAACCAAACAAC T GCAGGT GC T GAGGT T TCT GGGAATC T CCAT T GGGGT GAC ACAAA 

y! TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
ill ACAGACCAAATGATGTCCTTGAAGAATGACAACTCTCAGCACCTGTCATGTCCCTCAGTAGAACT 
Tl GT T GAAACCAAGCCT GT CAAGAATCTT TGAACACACATC CAT GGCAAACAGCT T TAATACACACT 

T T GAGAT GGAGGAGT TATAAAAAGAAAT GTCACAGAAGAAAACCACAAACT T GT T T TAT TGGACT 
U- TGT GAAT T T T T GAGTACATACT AT GT GT T T C AGAAATAT GTAGAAATAAAAAT GT TGCCATAAAA 

Q TAACACCTAAGCATATAC TATTCTAT GCTTTAAAATGAGGATGGAAAAGTTTCATGTCATAAGTC 

2 ACCACCTGGACAATAATTGATGCCCTTAAAATGCTGAAGACAGATGTCATACCCACTGTGTAGCC 
m TGTGTATGACTTTTACTGAACACAGTTATGTTTTGAGGCAGCATGGTTTGATTAGCATTTCCGCA 
J ? ;{ TCCATGCAAACGAGTCACATATGGTGGGACTGGAGCCATAGTAAAGGTTGATTTACTTCTACCAA 
US C TAGT ATAT AAAGTAC TAATTAAAT GCT AACATAGGAAGT TAGAAAATAC T AATAAC T T T TAT TA 

O CTCAGCGAT CTATT C T TC T GAT GCTAAATAAAT TAT AT AT CAGAAAAC T T T C AAT AT T GGTGAC T 

ACC T7\AAT GT GATT T TTGCTGGT TACTAAAATAT TC TTACCAC T T AAAAGAGCAAGC TAACACAT 
P\ TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
J! T TCAGT T C TGATAATGT TAAGAATAACCAT TATGAAAAGGAAAAT T T GTCCT GTATAGCATC AT T 

f9 ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 

T GT TAT T TAAATAC T TAACCAC T AAT T T T GAAAAT TACCAG TGTGATACATAGGAAT CATTAT T C 
AGAAT GTAGT C T GGT CT T TAGGAAG TAT TAATAAGAAAAT T TGCACAT AAC T TAGT T GAT T CAGA 
AAGGACT T GT AT GCTGTTTTTCT CCCAAAT GAAGAC TCT T T TT GACACTAAACACT T T T TAAAAA 
GCT T AT CT TT GCC T T C TCCAAACAAGAAGCAATAGTC T CCAAG T CAAT AT AAAT TC TACAGAAAA 
TAGTGTTCTTTTTCTCCAGAAAAATGCTTGTGAGAATCATTAAAACATGTGACAATTTAGAGATT 
CTTTGTTT TAT T T CAC T GAT T AAT AT AC T G T G G C AAAT T AC AC AG AT TAT T AAAT T T T T T T AC AA 
GAGTATAGTATATTTATTTGAAATGGGAAAAGTGCATTTTACTGTATTTTGTGTATTTTGTTTAT 
T T C T CAGAAT AT GGAAAGAAAAT T AAAAT G T G T CAAT AAAT AT T T T C T AG AGAG T AA 
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■ m 



MAREDSWCLRCLLYALNLLFWLMSISVIAVSAWMRDYLNNVLTLTAETRVEEAVILTYFPWHP 
VMI AVCCFLI I VGMLGYCGTVKRNLLLLAWYFGSLLVI FCVELACGVWTYEQELMVPVQWSDMVT 
LKARMTNYGLPRYRWLTHAWNFFQREFKCCGWYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLSDLYQEGCGKKMYS FLRGTKQLQVLRFLG I S I GVTQI LAMI LT I TLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GATAAAGTCTTTCTCGTGAGAGCCTAGAGGCCTTAAAAAAAAAAGTGCTTGAAAGAGAAGGGGACAAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGG ATG CCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCT GGAGGAAC ACT GACCACCAGTTGGAT GAGTCT CAAGGTCCTCCT CT AT GTGAC AACC ATGTGAAT GGGGAG 
T GGT ACCACTT CAC GGGCAT GGC GGGAGAT GCC AT GCCT ACCT TCTGC AT AC C AGAAAAC CACT GTGGAACCC A 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
CT GAC C AAGCC C AGC GT C T GCT T CCAC GT CT ACT GT GGT CAT T TT T AT GAC AT C T GC GAC GAGGACT GCCAT GG 
CAGCTGCTC AGAT ACC AGC GAGT GCAC AT GC GCTC CAGGAACT GT GCT AGGC C C T G AC AGGC AGACAT GCTTT G 
AT GAAAATGAATGT GAGCAAAAC AACGGT GGCT GC AGT GAGATCT GTGT GAACCTC AAAAACTC CTAC CGCTGT 
GAGTGT GGGGT TGGCCGT GTGCT AAGAAGTGAT GGC AAGACTT GT GAAGACGTT GAAGGAT GCCACAAT AAC AA 
TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
CT GAGGATAACCACACTT GCCAAGTCC CT GT GTT GTGCAAATC AAAT GCCAT TGAAGTGAAC AT CCCC AGGGAG 
CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
CCTCTTCTCTCTCAAGACATGTGGTACAGTGGTCGATGTGGTGAATGACAAGATTGTGGCCAGCAACCTCGTGA 
CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
ACCTGCGAGTTTCCACGCCTGTACACCATTTCTGAAGGATACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 
CAT GAGCC GAAAT C AT GGG AT CT T CCC AT T C ACTC T GGAGATC T T C AAGGAC AAT G AGTT T GAAGAGC CT T ACC 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TTGGAAAGCTT GGT GGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGT CCT GAAAT ACT ACCTC AT 
r|| CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCT GTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
CAC AAAAT C AGAAGC T G GGT AT AAT AT T T C AAGTT AC AAAC CC T AGAAAAAT T AAACAGT T AC T GAAAT TAT GA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
TT ATAGGGAAT TT GGAAGT GT ATCAAT AAAACAGT AT AT AATT TT 
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MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDVVNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CE FPRL YT I SEGYVPNLRNS PLE IMSRNHGI FPFTLE I FKDNE FEEPY REALPTLKLRDSLY FGI E PWHV 
SGLESLVESCFATPTSKIDEVLBCYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

Important features of the protein: 



01 Signal peptide: 

UJ amino acids 1-16 



•sM N-glycosylation sites. 

r"4' 



amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 



lJ N-myristoylation sites. 

f|| amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 

522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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GAGAGAGG CAGC AGCTTG CT CAGCGG AC AAGGAT GC TGGGCGT G AGGGAC CAAGGCCTGCC C TGCACT CGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCT GC CTTGGGGTGAC AATCT CAGCT C CAGGCT AC AGGG AG AC CGGGAGGAT C ACAG AGCC AGC ATGT 
TACAGG AT CCTGAC AGT GAT C AACCT CTG AAC AGCCT C GATGTC AAACCC CT GC GC AAACCCCGTATCCC C 
AT GGAGACCTTCAGAAAGGT GGGGAT CCCC AT CATC AT AGC ACT AC TGAGCCTGGC GAGT AT CATC AT T GT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GGAAGCAGCTGTGTGACGGAGAGCTGGACTGTCCCTTGGGGGAGGACGAGGAGCACTGTGTCAAGAGCTTC 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
C AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
2^ GCT ACAGCAG AGCT GTGGAGATTGGC CC AGACC AGG AT CT GG AT GT TGT T GAAATC AC AG AAAACAGC C AG 

fll GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
fl I TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
%j TC AGCATC CAGT AC G AC AAACAGC AC GTCT GT GGAGGGAGC ATC CT GG AC CCCC ACTGGGTCCT C ACGGC A 

y| GCCCACTGCTTCAGGAAACATACCGATGTGTTCAACTGGAAGGTGCGGGCAGGCTCAGACAAACTGGGCAG 
fij CTTCCC ATCCCT GGCTGTGGCCAAGATCATCATCATTGAATTCAACCCCATGT ACCCCAAAGACAATGACA 

% TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
■W GATGAGGAGC TCACTCC AGC C ACCCC ACT C TGGATC AT TGGATGGGGCTT T ACGAAGCAGAAT G GAGGGAA 

? G ATGTCTGAC AT ACT GCT GC AGGCGT C AGT CC AGGT CATT GACAGC ACACGGT GC AAT GC AG AC GATGCGT 

a 

;::! ACCAGGGGGAAGTCACCGAGAAGATGATGTGTGCAGGCATCCCGGAAGGGGGTGTGGACACCTGCCAGGGT 
2 GACAGT GGTGGG CCCCTGAT GT ACCAATCT GAC C AGTGGC AT GT GGT G GG CATC GT T AGCTGGGGCT ATGG 

»3i CTGCGGGGGCCCGAGCACCCCAGGAGTATACACCAAGGTCTCAGCCTATCTCAACTGGATCTACAATGTCT 

i U 

GGAAGGCTGAGCTG TAA TGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGATCCCCCAAAGTCAGACACAGAGCAAGAGTCCCCTTGGGTACACCCCTCTGCCCACAGCCTCAGCAT 
T T CT T GGAGC AGC AAAGGGCCTCAAT TCCT GT AAGAGACCCT CGC AGC CC AGAGGCGCCC AG AGGAAGTC A 
GCAGCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 
CAGGGGT ATT GC T AAGCCAAGAAGG AACT T TCCC AC ACT AC T GAATGG AAGC AG GCTGTC TT GT AAAAGC C 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GCCT ACT AGAGC AAGAAACC AGTT GT AATATAAAAT GC ACT G CCCT ACTGT TGG T ATGAC T AC CGT T ACCT 
ACTGT T GT C ATT GT T ATT AC AGCT AT GG CC ACT AT T AT T AAAGAGCTGTGTAAC AT CTCT GGC AAAAAAAA 
AAAA 
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MLQDPDS DQPLNSLDVKPLRKPRI PMET FRKVGI P 1 1 IALLSLAS 1 1 1 WVL I KVI LDKY YFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEAIAETACRQMGYSRAVEIGPDQDLDVVEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVSIQYDKQHVCGGSILDPHWVLTAAHCFRKHTDVFNWKVRAGSDKL 
GS FP SLAVAKI 1 1 1 E FNPMYPKDND I ALMKLQFPLT FSGTVRP I CLPFFDEELT PAT PLW 1 1 GWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACT ATG CTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCT^AACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
CAGAAGGGGTCTGCTTAGACCACCTGGTTTATGTGACAGGACTTGCATTCTCCTGGAACATGAGG 
GAACGCCGGAGGAAAGCAAAG TGGCAGGGAAGGAAC T TGT GCCAAAT TATGGGTCAGAAAAGAT G 
GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCT.GCATTCAGTGGACATGTGGGGGAAGGGCTG 
CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 
GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
CTCAATTTAAATCATGTTCTAGTAATTGGAGCTGTCCCCAAGACCAAAGGAGCTAGAGCTTGGTT 
CAAAT GAT CTCCAAGGGCCCT TATACCCCAGGAGAC T T TGAT T T GAAT T T GAAACCCCAAATCCA 
AACCTAAG7\ACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGC TACTCGGGAGGC TGAGACAGGAGAAT TAC T TGAACC TGGGAGGT GAAGGAGGC T GAGACA 
GGAGAAT CACT T CAGCC TGAGCAACACAGCGAGAC TC TGT C TCAGAAAAAATAAAAAAAGAAT TA 
TGGTTATTTGTAA 
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FIGURE 114 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPS PT VSRLEAL TRAVQVAE PLGS CGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 
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FIGURE lis 



CAGCAGTGGTCTCTGAGTCCTCTCAAAGCAAGGAAAGAGTACTGTGTGCTGAGAGACC ATG GCAA 
AGAATCCTCCAGAGAATTGTGAAGACTGTCACATTCTAAATGCAGAAGCTTTTAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
TCTACAGCAATGGAGAGAAGAAGAAGAT T TACATGGAAATTGATCCTGTGACCAGAACTGAAATA 
TTCAGAAGCGGAAATGGCACTGATGAAACATTGGAAGTGCACGACTTTAAAAACGGATACACTGG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
C T G AAC C AG AAGAGGAAAT AG AT G AG AAT G AAG AAAT T ACC AC AAC T T T C T T T GAAC AG T C AG T G 
ATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGATTTTCTTAAAAATTCCAAAATTCTGGA 
GATTTGTGATAACGTGACCATGTATTGGATCAATCCCACTCTAATATCAGTTTCTGAGTTACAAG 
ACT T T GAGGAGGAGGGAGAAGATC T T CAC TT TCCTGCC AACGAAAAAAAAGGGATT GAACAAAAT 
GAACAGT GGG T GGT CCC TCAAGTGAAAGTAGAGAAGACCCGT CACGCCAGACAAGCAAGT GAGGA 
AGAACT T CCAATAAATGAC T AT ACT GAAAAT GGAATAGAAT T T GAT CCCAT GC TGGAT GAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
T7VACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
TTATCACCAAAZUUUUUUWUWW^ 
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FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
E FSE PEEE I DENEE I T TT FFEQSVI WVPAEKP I ENRDFLKNSKI LE I CDNVTMYW INPTL I S VSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 



amino acids 1-40 

r! 

m 



Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 



n 
Lfi 

Q Glycosaminoglycan attachment sites. 

ffi amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 

U 242 

i y 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCT CCCCTC AGGAGCGCGTT AG CT TC AC ACCT T C GG CAGCAGGAGGGC GGC AGC TT CTCGCAGGCGGC A 
GGGCGGGCGGCCAGGATC ATGT CCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CC GT GT TCC AGT AC GAAGGGCTCT GGAGGAGC T GCGTG AG GC AGAGTT C AGGCT TC AC CG AATGCAGGCCC 
T ATT TC ACC ATC CT GGGACTTCCAGC CAT GCT GCAGGC AGTGCGAGCCCT GATG AT CG T AGGCATCGT CC T 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
M GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
y TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 
# :!! GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 

I si 

Til TGGGT C C AAC ACCAAAAAC AAG AAGAT AT ACG ATGG AGGTGCCC GC ACAGAGGAC GAGGT ACAAT C T T AT C 

if'! CTTCCAAGCACGACTATGTG TAA TGCTCTAAGACCTCTCAGCACGGGCGGAAGAAACTCCCGGAGAGCTCA 
I A CCC AAAAAAC AAGGAGAT CCC ATCTAGAT T TCTTCT TGCT TT TG ACTC AC AGCT GG AAGT T AGAAAAGCC T 

f:| CGAT TT CAT CTT TGGAGAGGCC AAAT GGT C TT AGCC TCAGTCTCT GTC TCT AAATATTCC AC CAT AAAAC A 

;| GCTGAGTTATTTATGAATTAGAGGCTATAGCTCACATTTTCAATCCTCTATTTCTTTTTTTAAATATAACT 
Q . TTCTACTCTGATGAGAGAATGTGGTTTTAATCTCTCTCTCACATTTTGATGATTTAGACAGACTCCCCCTC 

lU TTCCTCCTAGTCAATAAACCCATTGATGATCTATTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAA 
M GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 
ACACTT AC TG AAGAAG AAGC AAT AAG AG AAAGAT AT TT GT AATCTCT CC AGCCC AT G ATCTC GGTTT T CTT 
J 5 ! ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 
TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGGTC AGAAAT T G TCCC T AGAT G AATG AGAAAATT AT TT TT TT T AAT T T AAGT CCT AAAT ATAGT TAA 
AAT AAAT AAT GT TT T AGT AAAATGAT AC ACTATCTCTGT GAAAT AGCCTC ACCC CT AC AT GT GGAT AG AAG 
GAAAT GAAAAAAT AAT TGCT TT GAC ATTGT CT AT AT GGT ACT TT GT AAAGTC AT GCTT AAGT AC AAAT TC C 
AT GAAAAGCT CACACC TG TAAT CCT AGC ACTTT GGG AGGCTGAGGAGGAAGG AT CACTT G AGCC CAGAAGT 
TCGAGACTAGCCTGGGCAACATGGAGAAGCCCTGTCTCTACAAAATACAGAGAGAAAAAATCAGCCAGTCA 
TGGTGGCATACACCTGTAGTCCCAGCATTCCGGGAGGCTGAGGTGGGAGGATCACTTGAGCCCAGGGAGGT 
TGGG GCTGCAGT GAGC CATG AT CACACC ACTG CACT CC AGCC AGGTGACAT AGC GAGAT C CT GT CT AAAAA 
AATAAAAAAT AAAT AATG GAAC AC AGCAAGTC CTAGGAAGTAGGTT AAAACTAATT CT T T AA 
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FIGURE 118 



MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLPAMLQAVRALMIVGIVLGAIGLLVSIFALKCIRIGSMEDSAKANMTLTSGIMFIVSGL 
CAI AGVS VFANMLVTNFWMS TAM4YTGMGGMVQT VQTRYT FGAAL FVGWVAGGLTL IGGVMMCIA 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 

GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CAT C AGGAT GC AGT G CAAAAT C TAT GAT TCCCTGCTGGCTCTTTC T CC GGAC C TAC AGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGT GC ACG GGG GACAAT GAGAAG GT GAAG GCT CAC AT T C T G C TG ACG G C T G GAAT C AT CT T CAT 
CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
ACT CAATAGT GAATGT T GCCCAAAAACGT GAGCTT GGAGAAGCTCTCTACT TAGGAT GGACCACGGCA 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
C AGAT AC T C GAT ACC T T CC CAT C GC AC AAC CC AAAAAAG T TAT CACAC C GGAAAGAAGT C ACC GAGC G 
TCTACTCCAGAAGTCAGTATGTG TAG TTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 
ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
AAT CT TAAT TACAGGAACT GTGCATCAGCTAT T TAT GAT T CTATAAGCT AT T T CAGCAGAATGAGAT A 
T T AAACC CAAT GCT T T GAT T GT T C TAG AAAGTATAG TAAT T T GT T T TC T AAGG TGGT T C AAGCAT GT A 
CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 
ACGACATAGCAT TAT GT ACAT AGAT GAGT GTAACAT TT AT AT C T C ACATAGAG AC AT GCT TAT AT GGT 
TTTATTTAAAAT GAAAT GCCAGT CCAT T ACACT GAATAAAT AGAACTCAACTATT GCTTTTCAGGGAA 
ATCAT GGAT AGG G T T GAAGAAGG T TAC TAT TAATT GT T T AAAAAC AGC T TAG GGAT TAAT GT CC T C CA 
TTTATAATGAAGATTAAAAT GAAGGCT TTAAT CAGCATT GT AAAGGAAATT GAAT GGCT T T CT GATAT 
GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
T TC CAGGGC T AT ACT CAGAAGAAAGAT AAAAG T GT GATC T AAGAAAAAGT GAT GG T T T T AGGAAAGT G 
AAAATAT TTT TGTTT T T GTATTT GAAGAAGAATGAT GCAT T TTGACAAGAAAT CATATAT GTATGGAT 
ATATT TTAATAAGTATT TGAGTAC AGACT TTGAGGT T TCATCAATATAAATAAAAGAGCAGAAAAATA 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TAT ACAT T TAT AT TAAT AAATTGT ACAT T TTT CTAATT 
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FIGURE 120 

MATHALE I AGLFLGGVGMVGTVAVT VMPQWRVS AFIENNI WFENFWEGLWMNCVRQAN I RMQCK 
IYDSLIALSPDLQAARGIMCAASVMSFLAFMl^ 

MVVLIPVSWVANAIIRDFYNSIVNVAQKRELGEALYLGWTTALVLIVGGALFCCVFCCNEKSSSY 
RYS I PSHRTT QKS YHTGKKS PSVYSRS QYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 

GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GAT T CAAAGGAGAAAAGGGGGAAT GT CT GAGGGAAAGCT T TGAGGAGTC CTGGACACC CAAC TAC 
M AAGCAGT GT T CATGGAGT TCAT TGAATT AT GGCATAGAT C T TGGGAAAAT T GCGGAG TGTACAT T 

Q TACAAAGAT GCGT TCAAATAGTGCTCTAAGAGT TTTGTTCAGT GGCT CACT TCGGCTAAAATGCA 

y GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
T\ ATT GAAGCTATAAT TTAT TTGGACCAAGGAAGCCCTGAAATGAATT CAACAATTAATAT TCATCG 

■V| . CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
l|| TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 
W ATTATTGAAGAACTACCAAA ATAAA TGCTTTAATTTTCATTTGCTACCTCTTTTTTTATTATGCC 
^ T T G GAAT G G T T C AC T T AAAT GACAT T T T AAAT AAG T T TAT GT AT ACAT C T G AAT GAAAAGCAAAG 

fjj CT AAAT AT GT T TACAGACCAAAGT GT GAT TTCACAC TGT T T TT AAAT CT AGCAT TAT T CAT T TT G 

f'% CT TCAAT CAAAAGT GG TTT CAATAT T TT T T T TAGT TGGT TAGAATAC TT T CTT CATAGTCACATT 

01 CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
fl 



ry 



AAAAAATATAAAAGCTACCAATCTTTGTACAATTTGTAAATGTTAAGAATTTTTTTTATATCTGT 
TAAATAAAAAT TAT TTCCAACA 
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FIGURE 122 



MRPQGPAAS PQRLRGLLLLLLLQLPAPS SASE I PKGKQKAQLRQRE WDLYNGMCLQGPAGVPGR 
DGSPGA3WIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 



APP_ID=1 0063730 



Page 276 of 322 



FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 

CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 

ATTGCCTGGAAGAATACATCATGTTTTTCGATAAGAAGAAATTGTAGGATCCAGTTTTTTTTTTA 

ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 

TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 

TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 

TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 

GCCCCTAAGCCAAAGCAAAAGACCTAAGGACGACCT TTGAACAATACAAAGG ATG GGT TTCAATG 

TAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTTATAGCCCCCACTGTCTTACTGACAATG 

CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 

AT CT CAGAAAT T ACAGGAGATACC C T C AAGT ATAT CTGC TGGT TGC T TAGG TT T GTCCCT T CGCT 

ATAACAGCCTTCAAAAACT TAAGTATAAT CAAT TTAAAGGGCTCAACCAGCTCACCT GGCTATAC 

CT TGAC CAT AACCATAT CAGCAATAT TGACGAAAATGC T T T TAATGGAATACGCAGAC TCAAAGA 

GCT GAT T CT TAG T TCCAAT AGAAT C TCC TAT T T TC T TAACAATACC T T CAGACC TGT GACAAAT T 

TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 

CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 

AGACT GC C GCAACC T GGAAC T T T T GGACCTGGGATATAAC CGGAT CCGAAGT T TAGC CAGGAATG 

T CT T T GCT GGCAT GATCAGACT CAAAGAAC T TCAC C TGGAGCACAATCAATT T T CCAAGC T CAAC 

CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 

CAT AGGACAGACCATGT CCTGGACCT GGAGCTCC T TACAAAGGCT T GAT T T ATCAGGCAAT GAGA 

TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 

TCCAACAAGCTCACATTTATTGGTCAAGAGATTTTGGATTCTTGGATATCCCTCAATGACATCAG 

TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 

TTAAAGGTCTAAGGGAGAATACAATTATCTGTGCCAGTCCCAAAGAGCTGCAAGGAGTAAATGTG 

ATCGATGCAGTGAAGAACTACAGCATCTGTGGCAAAAGTACTACAGAGAGGTTTGATCTGGCCAG 

GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 

TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 

TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 

CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 

GGCACAGGAAAAAGAAAAGACAGTCCCTAAAGCAAATGAC TCCCAGCACCCAGGAATT TTATGTA 

GAT TATAAACCCACCAACACGGAGACCAGCGAGATGCTGC TGAATGGGACGGGACCCTGCACCTA 

TAACAAATCGGGCTCCAGGGAGTGTGAGGTATGAACCATTGTGATAAAAAGAGCTCTTAAAAGCT. 

GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 

TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 

GGT CATT T T CC TCTCATACATAAT CAAC CCAT T GAAAT T TAAATACCACAAT CAATG T GAAGCT T 

GAACTCCGGTTTAATATAATACCTATTGTATAAGACCCTTTACTGATTCCATTAATGTCGCATTT 

GTT T T AAGATAAAACT TC T T T CATAGGTAAAAAAAAAAA 
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FIGURE 124. 

MGFNVI RLLSGSAVALVIAPTVLLTMLS SAERGCPKGCRCEGKMVYCESQKLQE I PS S I SAGCLG 
LSLRYNS LQKLKYNQFKGLNQLTWL YLDHNH I SN I DENAFNG I RRLKEL I LS SNR I S YFLNNTFR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
l^OTFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKS FKGLRENT 1 1 CAS PKELQGVNVI DAVKNYS I CGKS TTERFDLARAL PKP T FKPKL PRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360/ 496-499, 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 12s 

CCGTTATCGTCTTGCGCTACTGCTGA ATG TCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
AT T GAAGAGGAAGGC TT TCTAAAGC TTT GGCAAGGAGT GACAC CCGC CAT T TACAGACAC G TAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
AT GAGCAT TAT CCCC T T TGGAAAT CAGT CAT T GGAGGGAT GAT GGCT GGT GTTAT TGGCCAGT T T 
TT AGCCAAT CCAAC T GACC TAGT GAAGGT TCAGAT GCAAATGGAAGGAAAAAGGAAACTGGAAGG 
AAAACCATTGCGATTTCGTGGTGTACATCATGCATTTGCAAAAATCTTAGCTGAAGGAGGAATAC 
GAGGGCTTTGGGCAGGCTGGGTACCCAATATACAAAGAGCAGCACTGGTGAATATGGGAGATTTA 
AC C AC T TAT GATAC AGT GAAACAC TACT T GGT AT T GAAT ACAC CAC T T GAGGACAATAT CAT GAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
T CAAAAGCAGAATAAT GAATCAAC CACGAGATAAACAAGGAAGGGGAC TTT T GTATAAATCATCG 
ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
ff T GAG T GGAGT CAGT CCAT T T TAA 

PS ' 
M 

r:: 
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FIGURE 126 



MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALG 1 I EEEGFLKLWQGVT PAI YRHWYS GGRMVT YEHLRE WFGKS E DEHYPLWKS 
VI GGMMAG V I GQ FLANP T DLVKVQMQME GKRKLE GKPLR FRGVHHAFAK I LAE GG I RGLWAGWVP 
NI QRAALVNMGDLT T YDT VKHYLVLNT PLE DNIMTHGL S S LCS GLVAS I LGT PADVI KS RIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 

CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAATGAAGAGGACATCCTCTCCATGTTCTCAGCTATCCGTTCTCAGCACAGCGGTGTAGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
AGGAGCGGAATGTGGACGATGGGCACATCATTAACATCAATAGCATGTCTGGCCACCGAGTGTTA 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
AGACACAAT T CGCCTTCAAACT CCACGACAAGGACCCTGAGAAGGCAGCTGCCACCTATGAGCAA 
ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CATCCAGAT T GGAGACAT CCAGAT GAGGCCCACGGAGCAGGTGACC TAGT GAC T GTGGGAGCTC C 
TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
CAC GGGATACCAC T T CC T G T CCACACC CCGACC AGGGGC T AGAAAAT T T G T T T GAGAT T T T TATA 
TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAA^ATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCGCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCC TGGCCCAGTGGAT T TCAT GGT GATCATTAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 



MARPGMERWRDRLALVTGASGGIGAAVARALVQQGLKVVGCARTVGNIEELAAECKSAGYPGTLI 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGI^PDTLLSGSTSGWKDMFIWNVLALSICTR 
EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAftATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 



Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 



t\ 199-205 

■■•si 



Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 12Q 

AACTTCTAC ATGG GCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGGAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTG71AATGAGGAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
T TAAT T T TT AACTGATAGT TGT ACAT AT T T GGGGGTACAT GT GAT AT T T GGATACATGTAT ACAA 
TATATAAT GATCAAATCAGGGTAAC T GGGATATCCATCACAT CAAACAT T TAT T T T T TAT TC TTT 
TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCTC'TGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGGTCCCACATGTG 
AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
T TCCATCCATGTTGCTGCAAAT GACAGGATTTCGTTCTTAATTTCAAT TAAAATAACCACACAT G 
GCAAAAA 



FIGURE iso 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 



M N-myristoylation site. 

amino acids 8 6-92 

m 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

VI amino acids 68-79 



i H 
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FIGURE 131 



m 

Ui 



CI 



TT CTG AAGT AAC GGAAGCT ACCTT GT AT AAAG AC CT CAAC ACTGCT GACCATGAT C AGC GCAGCCT GG AGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAAT AC CAGAGG AT GCTACAACTCTC T ACCTT CAGAACAACCAAAT AAATAAT GC T GGGAT T CCTTC AGAT 
TT GAAAAACTTGCT G AAAGT AGAAAGAAT ATACCT AT AC C AC AAC AGT TT AG AT GAAT TTCCT ACCAACCT 
CC CAAAGT AT GT AAAAGAGT T AC ATT TGC AAG AAAAT AAC AT AAGG ACT ATC ACTT AT G ATT CACTTT CAA 
AAAT TC CCT ATCTGG AAG AATT AC AT TT AGAT GACAACT CT GTCTCTGC AGT TAGC AT AG AAGAGG GAGC A 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
GC CC AGG ACT AT AGAAGAAC TACGCT TGGATG AT AATCGC AT AT CCACT ATT T CAT CACC AT CT CT T C AAG 
GT CTC ACT AGT C TAAAAC GCC T GGTT C T AGAT GG AAACCT GT TG AACAAT C AT GGT T TAGGT GAC AAAGT T 
TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
TCCAGGCACAAACCTGAGGAAGCTTTATCTTCAAGATAACCACATCAATCGGGTGCCCCCAAATGCTTTTT 
CT TAT CT AAGGC AGCT CT AT CGACTGGAT ATGT CC AAT AATAAC CT AAGT AAT T T ACCTC AGGGT AT C T T T 
GATG AT TT GGAC AAT ATAAC AC AACT GAT T CT TCGC AACAAT CC CT GGT ATT GCGGGT GC AAGATGAAAT G 
GGTACG TGAC TGGT T AC AAT C ACT ACCT GT GAAGGT CAACGT GCGTGGGC TCAT GT GCC AAGCCCC AG AAA 
AGGT TC GT GGGATGGC T ATT AAGG AT CTCAAT GCAGAACTGT TT GATT GT AAGG AC AGT G GGAT T GT AAGC 
AC CAT T C AG ATAACC ACT GC AAT ACC CAAC AC AGTGTATCCT GC CCAAGGAC AG TGGCC AGCTC C AGT GAC 
CAAACAGCCAGATATTAAGAACCCCAAGCTCACTAAGGATCAACAAACCACAGGGAGTCCCTCAAGAAAAA 
CAAT T ACAAT TACT GT GAAGTCTGTC ACCT CT G ATACC ATTC AT ATCT CT TGG AAACTTGCT CT ACCT AT G 
AC TGCTT T GAGACT CAGCT GGCTT AAACT GGGCC AT AGCC CGGC AT TTGG ATCT AT AACAGAAACAAT TGT 
AACAGGGG AACGCAGT GAGT AC TT GGT CAC AGCCCT GGAGCCTGATTCACCCT AT AAAGT AT GC AT GGTT C 
CCATGGAAACCAGCAACCTCTACCTATTTGATGAAACTCCTGTTTGTATTGAGACTGAAACTGCACCCCTT 
CG AAT GT AC AAC CCT ACAACC ACCCT CAAT CG AG AGCAAG AGAAAGAACC TT AC AAAAACCCC AAT TT ACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
T T CAT AGG AATGGATC GCTCTT C T CAAGGAACT GT GCAT ATAGC AAAGGG AGGAGAAGAAAGGAT GAC TAT 
GC AG AAGCT GGC AC T AAG AAGG AC AACT CT AT CCTGGAAATCAGGG AAAC TT CT TT T C AG AT GTT ACC AAT 
AAGC AATGAACC CAT C TCGAAGGAGGAGT T TGT AAT AC AC ACCAT AT T TC CTCC TAATGGAATGAATC TGT 
AC AAAAACAATC AC AGTG AAAGC AGT AGT AACCGAAGCT AC AGAGAC AGT GGT ATT CC AGACT C AG AT CAC 
TCACACTCATGATGCTGAAGGACTCACAGCAGACTTGTGTTTTGGGTTTTTTAAACCTAAGGGAGGTGATG 
GT 
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FIGURE 132 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNS VSAVS IEEGAFRDSNYLRLLFLSRNHLST I PWGLPRT IEELRLDDNRI S T I S S PSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVNTOGmCQAPEKVRGMAIKDLNAELFDCKDSGIVSTIQITTAIPNTVYPAQGQWPAPVTKQPD 
IKNPKLTKDQQTTGS PS RKT I T I T VKSVTSDT I HI SWKLALPMTALRLSWLKLGHS PAFGS I TET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNLPLAAIIGGAVALVTIALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 
ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Ami da ti on site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 133 

CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
f,.^ TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
Q C TCCACTACC TCAAGCT GAGT GAT C CCAAGTACCTAAGAGAGT T CCAGCT GAC CC TCCAGCCCGG 

P GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
fl CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
rj GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
hj AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
Q GACCACTCCAACAGAGCCAGGACTATATCAACCTCTTCTGCGCCAACATGATGGACTTGAACCGC 
^ AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 

U'l 

»S AACAGCAGGAAGGATGC T T CGGGGAGCC T GAT GCTGAAGATGAAGAAT TAT CTAAAGC TAT TCAA 

W 

jjjj TAT CAGCAGCAT T T T TCGAGGAGAGT GAAGAGGCGAGAAAAACAATT TCCAGAT TCT CGCTC TGT 

Cj TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
HI AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 

T TTTATATTT TTT TAGTAGAGACAGGGT TTCATCATGTTGC TCAT GCTGGTCTCGAAC TCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCT GAAAAGCACT T T CAAAGAGACT GTGT T GAATAAAGGGCCAAGGT T CT TGCCACCCAGCACT C 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GT GGCT T C CTATACATCC TGGCAGAATACCCC C CAGCAAACAGAGAGCCACACCCATC CACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
T T TAGT CCTCATCCC T TAGAT C C TGGAGGGCAC GGAT CACATCC TGGGAAGAAGGCAT C TGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
G GC C G G GAG G GAC C C AG GTGT G AAC G GAT G AAT AAAG T T C AAC T G C AAC T G AAAAAAAAAAA 
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FIGURE 134 

MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWS'SSLPGLD 
TAESKATIADLILSALERATVFLEQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLRE FQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
Y I NL FCANMMDLNRRAEAI GYAYPTRDI FMENIMFCGMGGFS DFYKLRWLEAI L S WQKQQEGC FG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites - 

amino acids 149-156, 274-282 

N-myristoylation sites . 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1 . 

amino acids 230-237 
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FIGURE 135 

GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGC T TACAGAT T TGATC CCGT T CGAGT GGAT ATCACT T C GAAAGGAAAAAT GAGAGCAAGAT AT G 
TGAAT T ACAT CAAAACATCAGAGG T TGT CAGAC TGCCCTATCC T CT CCAAATGAAAT C T TCAGGT 
C CAC C T T CT T AC T T TAT TAAAAGG GAAT C G T GGGGC T G GAC AG AC T T T C T AAT GAAC CC AATGG T 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
CTGACATGAGACGGGAAATGGAGCAGTCAATGAATATGCTGAATTCCAACCATGAGTTGCCTGAT 
GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 
TAAAACAGGCAAAAGTGGGGCT GGCAAAAGGAGG TAG TCAGGCCGT CCAGAGC TGGCATT TGCAC 
AAACACGGCAACAC TGGGTGGCAT C CAAG TC T T GGAAAACCGT GT GAAGCAAC TAC TATAAACT T 
GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAG G T CAAT AT T GAT GT CAC T GAAT T AAT T AC AGT G T CC T AT AGAAAAT GC C AT T AATAAAT TAT 
AT GAACT ACT ATACAT T AT GT AT AT TAATTAAAACATCT TAAT CCAGAAAT CAAAAAAAAAAAAA 
AAAAAAAAAAAAAAA 
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FIGURE 136 

MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGSFWHDIPSGSYWEWSPAYRFDPVRVDITSKGKMRARYVNYIKTSE 
WRLPYPLQMKSSGPPSYFIKRESWGWTDFIJyiNPMVMMMVLPLL 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site, 

amino acids 184-187 

Glycosaminoglycan attachment sites. 

amino acids 37-40, 236-239 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP -binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 



GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGGACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGAGCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
Ui AAGCTGTGTGATCGCCACAAACCTTCAGGAAATACGAAATGGATTTTCTGAGATACGGGGCAGTG 
'f?'l TGCAAGCCAAAGATGGAAACAT T GACATCAGAATCT TAAGGAGGACT GAGTCTT TGCAAGACACA 

Q AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 
I . | AAGAAATACAGCCAGAT TCTGAGTCACT TTGAAAAGCTGGAACCTCAGGCAGCAGT TGTGAAGGC 

Q T TT GGGGGAAC TAGACATTCT T C TGCAATGGATGGAGGAGACAGAATAGGAGGAAAGTGATGCT G 

n" CTGCTAAGAATATTCGAGGTCAAGAGCTCCAGTCTTCAATACCTGCAGAGGAGGCATGACCCCAA 
5] ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
X! CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 

TGTAATATCTTTCTGCTATTGGATATATTTATTAGTTAATATATTTATTTATTTTTTGCTATTTA 
□ ATGTATT TATT TTTT TACTTGGACAT GAAACT T TAAAAAAATTCACAGAT TATAT TTATAACCTG 

f|j ACT AGAGCAGGTGAT GTAT T T T T ATACAG T AftAAAPAAAAAACC TTGT AAAT TC TAGAAGAG TGG 

CTAGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
AT T T GAAAT T GAAC C AAT GAC T AC T TAG GAT GGG T T G T GGAAT AAG TTTT GAT G T GGAAT T GC AC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
CCAGGAATCCTACACGGCCAGCATGTAT TTCTACAAATAAAGT TTTCTTT GCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 

MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 
SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 
FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 
SIANSFLTIKKDLRLSHAHMTCHCGEEAMKKYSQILSHFEKLEPQAAWKALGELDILLQWMEET 
E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51/ 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
TTTGTGGCTGAAGCAAAGCCACCTACAGTTACTATGCCTCGAATCAAGGCATTGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGC TGGAAAAAGAATAGTCTT TTATGGAGAT GAAACCTGGGTTAAAT TA 
T T CCCAAAGCATT T TGTGGAATAT GAT GGAACAACC TCAT TTT T CG TGT CAGAT TACACAGAGGT 
GGAT AATAAT G TCAC GAGGCAT T T GGATAAAGT AT TAAAAAGAGGAGATT GGGACATAT TAATCC 
TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
AAGCTGAGCGAGATGGACAGCGTGCTGATGAAGATCCACACCTCACTGCAGTCGAAGGAGAGAGA 
GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
yl CCCGGTGATATCCGACATCCAAAGCACGTCCAATAGACGGATGTGGCTGCGACACTGGCGATAGC 
i a | AG T T GGC T T AC CGAT TCCAAAAGACAGT GT AGGGAGCCTCCTAT TCCCAGT TG T GGAAGGAAGAC 

•H CAAT GAGAGAGCAGTTGAGATT TTTACATTTGAATACAGTGCAGCTTAGTAAAC TGTTGCAAGAG 

AATGTGCCGTCATATGAAAAAGATCCTGGGTTTGAGCAGTTTAAAATGTCAGAAAGATTGCATGG 

Ul GAACTGGATCAGACTGTACTTGGAGGAAAAGCATTCAGAAGTCCTATTCAACCTGGGCTCCAAGG 

PI 



a 



a 

m 



HI 



TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
Q AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
CAAGGT GATT G TAAAGAGG TGGCGGT CACAGAGGAACAAGCCC CCCAGCTGAGGGGG T GT GT GAA 
*£* T CGGACAGCCTCCCAGCAGAGGTGTGGGAGC T GCAGC TGAGGGAAGAAGAGACAATC GGCC T GGA 

Bl CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACT CAGAT CCACAGAGCCCAGGAT CAAGGGACC CACT GCAGTGGCAGCAG GAC TGTTGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CTGACC GAGACAC T CACAGC T T TGTCAT CAGGGCACAGGC T TCC T C GGAGCCAGGATGATC TGT G 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
C T GCACACAG TATGTAGT TACCAAAAGAAT AAACGGCAATAAT T GAGAAAAAAAA 
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FIGURE 14Q 



MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGVKFMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVIRNLNSPALLEDSVIRQT^KAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWD I L I LH YLGLDHI GH I SGPNS PL I GQKL S EMDS VLMKI HT S LQS KERET PLP 
NLLVLCGDHGMSETGSHGASSTEEVNTPLILISSAFERKPGDIRHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

1 Transmembrane domain: 

I ■ 

* amino acids 58-76 

I 

N-glycosylation sites. 

amino acids 56-60/ 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

I 

Amidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 



APP 



ID= 
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FIGURE 141 



m 



ill 



■*4 ? 



GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TCCCTATAGAAAACAACTGCCAGCACCT TAAGACCACTCACACCTT CAGAGTGAAGAACTTAAAC 
CCGAAGAAATTCAGCATTCATGACCAGGATCACAAAGTACTGGTCCTGGACTCTGGGAATCTCAT 
AGCAGT T CCAGATAAAAAC TACATACGC CCAGAGAT C TT C T T T GCAT TAGCCT CATCC TTGAGC T 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
T GT GACAAGGATAAAGGACAAAGT CAT CCAT CCCT T CAGC TGAAGAAGGAGAAAC T GATGAAGC T 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GTT GGGGT GACAGATAAATTT GAGAAC AGGAAACACAT T GAAT T TT CATT T CAACCAGTT TGCAA 
AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 
AT T T GAAC TAAT T GTATAAAAACAC CAAACCT GC T CAC T 
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FIGURE 142 



MLLLLLEYNFPIENNCQHLKTTHTFRVKNLNPKKFSIHDQDHKVLVLDSGNLIAVPDKNYIRPEI 
FFALASSLSSASAEKGSPILLGVSKGEFCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVG S WNMLE SAAH PGW F I CTS CNCNE PVGVT DKFENRKH I E FS FQPVCKAEMS P SE VSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-rnyristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE Ida 

CTAGAGAGTATAGGGCAGAAGGATGGCAGATGAGTGACTCCACATCCAGAGCTGCCTCCCTTTAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGT GGAGTTTGC TGTCCACACAT TCAACCAACAGAGCAAGGACTAC TATGCC TACAGACTGGGG 
CACAT CT TGAATT CC TGGAAGGAGCAGGT GGAGTCCAAGAC TGTATT CT CAATGGAGCTAC TGCT 
GGGGAGAACTAGGTGTGGGAAAT TT GAAGACGACAT T GACAAC TGCCAT T T CCAAGAAAGCACAG 
AGCTGAACAATACTTTCACCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 
CTCCTGAACAAGACCTGCTTGGAGGGATTCCAC TGA GTGAAACCCACTCACAGGCTTGTCCATGT 
GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
T TGGAC TTGTTTGT TAT CCT AT T T T GCAT G T GT T TGAGAT CT CAGAT CAGT GT T T TAGAAAAT CC 
ACACATCTTGAGCCTAATCATGTAGTGTAGATCATTAAACATCAGCATTTTAAGAAAA^AAAJ^AA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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FIGURE 144 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHE 

YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE 145 

CTGTGCAGCTCGAGGCTCCAGAGGCACACTCCAGAGAGAGCCAAGGTTCTGACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAAT CAAG T GGAACC GGAAGGCCCTGCCCAGCAC TGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAAC TACT GGCAGTT CCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
C CCAGG C G GC GAAC CAG G GGG AG T T C C AGAAG C CAGAC AAC AAG C T C CAC C AGCAGG T GC T CT G G 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 
CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
GAGAAGAACAC AT CAGGCACT GCGCCAC C T GC T T CAC AG TACT T CCC AACAAC T C T TAGAGGTAG 
GT GTATTCCCGTT TTACAGATAAGGAAACT GAGGCCCAGAGAGCT GAAGTACTGCACCCAGCATC 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
C T C T AAT G AAAT T G T GAAAGC T C CAT G T T T AG AAAT AAAT G AAAAC AC C T G A 
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FIGURE 146 



MRKHLSWWWLAT VCMLL FS HLSAVQTRG IKHR IKWNRKALPS TAQI TEAQVAENRPGAF IKQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEAOTTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 

Important features of the protein: 
Signal peptide : 

amino acids 1-26 

Transmembrane domain: 



be* 

Q amino acids 157-171 



N-glycosylation sites. 

amino acids 98-102, 110-114 



amino acids 76-83 



Tyrosine kinase phosphorylation site. 



N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 



Amidation site. 

amino acids 62-66 
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FIGURE 147 



GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
C CT GAT GAT T T ATAGAC T CAAAGAAAACT CATGT TCAGAAGC T CTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACTTT CAGATTCAGGGGGTACATGTGAAGGT TTGTT TTATGAGTATAT TGCATGATGCTGAGG 
TTTGGGGT 



m 



m 
ca- 
rts 
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FIGURE 148 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of* the protein: 
Signal peptide: 

amino acids 1-25 



N-myristoylation site. 

amino acids 62-68 



□ 



H 
ill 



ill 
m 

rH 
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FIGURE 14Q 



GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAAT CTCTACAT AATAGAGAATTAAGTGCAGAAAGACCTT T GAATGAACAGAT TGCT GAA 
GCAGAAGAAGACAAGAT TAAAAAAACATATCCTCCAGAAAACAAGCCAGGT CAGAGCAACTAT T CT T T 
T GT TGATAACTTG7VACCTGCTAAAGGCAATAACAGAAAAGGAA?^AAATT GAGAAAGAAAGACAAT CTA 
U TAAGAAGCTCCCCACTTGATAATAAGTTGAATGTGGAAGATGTTGATTCAACCAAGAATCGAAAACTG 

Q ATCGATGATTATGACTCTACTAAGAGTGGATTGGATCATAAATTTCAAGATGATCCAGATGGTCTTCA 

fH 

*** T CAACTAGACGGGACTCCTT TAACCGCTGAAGACAT T GT C CATAAAATCGCTGCCAGGATT TATGAAG 

9 

% AAAATGACAGAGCCGTGTTT GACAAGATT GTTTCTAAACTACTTAATCTCGGCCTTATCACAGAAAGC 

T\ CAAGCACATACAC TGGAAGAT GAAGTAGCAGAGGTT TTACAAAAATTAATCTCAAAGGAAGCCAACAA 

III T TATGAGGAGGAT CCCAATAAGCCCACAAGCT GGACT GAGAAT CAGGCTGGAAAAATACCAGAGAAAG 

O T GACTCCAATGGCAGCAATT CAAGATGGT CTT GCTAAGGGAGAAAACGAT GAAACAGTATCTAACACA 

3 TTAACCTTGACAAATGGCTTGGAAAGGAGAACTAAAACCTACAGTGAAGACAACTTTGAGGAACTCCA 

rl 

Ssf< ATATTTCC CAAAT TTCTATGCGCT ACT GAAAAGTAT TGATT CAGAAAAAGAAGCAAAAGAGAAAGAAA 

!f'l 

CAC T GAT T ACTAT CAT GAAAACAC T GATT GAC T T T GT GAAGAT GAT G GT GAAAT AT G GAAC AATAT C T 
CCAGAAGAAGGTGTTTCCTACCTTGAAAACTTGGATGAAATGATTGCTCTTCAGACCAAAAACAAGCT 
f*\ AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
f|J C AGAC AGT AC C AAG GAAGAAGCAG C TAAGAT G GAAAAGGAATATGGAAGCT T G AAG GAT T C CAC AAAA 

GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 
CAT CAGAAAAAATATTGAATGGTTGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACCTT T 
CAAAGATGAGAGACTT CATCAATAAACAAGCT GATGCTT ATGT GGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATTTATAGCAGCCTGTAAAAATGGCAAAAGATCCAGGAGTCTTTCAA 
CTGTTTCAGAAAACATAATATAGCTTAAAACACTTCTAATTCTGTGATTAAAATTTTTTGACCCAAGG 
GT TATTAGAAAGT GCTGAATTTACAGT AGTTAACCTT TT ACAAGT GGT TAAA&CATAGCT TTCTT CCC 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE ISO 



MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYS FVDNLNLLKAI TEKEKI EKERQS IRS S PLDNKLNVEDVDS TKNRKL I DDYDS TKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKT YSEDNFEELQYFPNFYALLKS I DSEKEAKEKETL I T IMKTL I DFVKMMVKYGT I S PEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-rnyristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE 151 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
AT AAT AACCAGCTT CT AGCTGGAGGGCT GCAT GCAGGGAAGGT CATTAAAG GTGAAGAGATCAG C 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
T GT CAGACT CACCCAGC T TCC CGAGAAT GG TGGC TGGAAT GCC CCCATCACAGAC TTC TAC T TCC 
AGCAGTGTGAC TAG GGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
GGGT CAGTAGC TC TCCACATGAAGT CC T G T CAC TCACCAC TGT GCAGGAGAGGGAGG T GGTCAT A 
GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
fi GAAATGGC TC GAGCT CAGAAGAT AAAAGATAAGTAGGGTATGC T GATCCT CT T T TAAAAACCCAA 

GATACAAT CAAAAT CCCAGAT GCTGGT CT CTAT TC CC AT GAAAAAGTGCT CAT GACATAT T GAGA 
UJ AGACCTAC T TACAAAGT GGCATATATT GCAAT T TAT T T TAAT TAAAAGATACCTATT TATATAT T 

Q TCTTTATAGAAAAAAGTCTGGAAGAGTTTACTTCAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 
AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
^ AT T C C T T G T AT AAAAAT AAG AAAAG AAAT TAAT C T T GAG G TAAG CAG AG C AGAC AT CAT C T C T G A 

Q TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
LF1' TTGTAGTAGTGATCAGGAAACAGATCTCAGCAAAGCCACTGAGGAGGAGGCTGTGCTGAGTTTGT 
Q GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
%l GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
% ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
W.I GTCTCTGCAGATGTAGTTAGTTAAGACAAGGTCATGCTGGATGAAGGTAGACCTAAATTCAATAT 
HI GAC T GGT T T CC T TGTAT GAAAAGGAGAGGACACAGAGACAGAGGAGAC GCGGGGAAGACTATGTA 

AAGATGAAGGCAGAGAT CGGAGT TT TGCAG CCACAAGC TAAGAAACACCAAGGAT TGTGGCAACC 
ATCAGAAGCTT GGAAGAGGCAAAGAAGAAT T C T TCC CTAGAGGC TT TAGAGGGATAACGGC TC T G 
C T GAAACCT TAATC T CAGAC T T CCAGCC T C CTGAAC GAAGAAAGAATAAAT T TCGGC TGT T T TAA 
GCCACCAAGGATAAT TGGT TACAGCAGC T C T AGGAAAC TAATACAGC T GCTAAAATGATCCCTGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAGTGATGGCATGCCACTTCCAAGATTAGGTTATAAAAGACACTGCAGCTTC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GCCCACAGCCACATTAGTGAACCTAGAAGCAGAGACTCTGTGAGATAATCGATGTTTGTTGTTTT 
AAGT T GCT CAGT T T T GGT CTAACT T GT TATGCAGCAATAGAT AAAT AAT ATGCAGAGAAAGAG 
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FIGURE 1S2 

MVLSGALCFRMKDSALKVLYLHNNQLLAGGLHAGKVIKGEEISWPNRWLDASLSPVILGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins . 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE 153 

CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCA ATG GCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TATTGAAGGTGATGACCTGCATATCCAGAGGAATGTGCAAAAGCTGAAGGACACAGTGAAAAAGC 
T TGGAGAGAGTGGAGAGATCAAAGCAATT GGAGAAC T GGAT TT GCT GT T TATGTC TC T GAGAAAT 
GCCTGCATTTGACCAGAGCAAAGCTGAAAAATGAATAACTAACCCCCTTTCCCTGCTAGAAATAA 
CAATTAGATGCCCCAAAGCGATTTTTTTTAACCAAAAGGAAGATGGGAAGCCAAACTCCATCATG 
ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
AGACCAGAAGGTAGAC T T TC T AAGCAT AGATAT TTAT TGAT AACAT T TCATT GTAAC TGGTGT T C 
T AT AC AC AGAAAACAAT T TAT T T T T T AAAT AAT TGTCTTTTTC CAT AAAAAAGAT T AC T T T CC AT 
TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 
T GT AT TTAT TAT TAT TAT AAGACT GCAT T TTAT T TATAT CAT T T TATTAATATGGAT T TATT TAT 
AGAAACATCAT TCGATAT TGCTACTTGAGTGTAAGGCTAATATT GATATT TATGACAATAATTAT 
AGAGCTATAACATGT T TAT T TGACC T CAATAAAC AC T TGGATAT CC C 
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FIGURE 154 

MAALQKSVSSFLMGTLATSC^ 

ADNNTDVRL I GEKL FHGVSMSERCYLMKQVLNFTLEEVLFPQS DRFQPYMQEWP FLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites. 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE 155 

GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCGAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGGACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
\Z CCC CC TGGAGATATGAGT TGGACAGAGAC T T GAACC GGC T CCCCCAGGACC T GTACCAC GCCCGT 

% TGCGTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
r|) GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
UJ AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGC TAG CCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 

If? 

ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 

{3 

f ? | CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
Lf! GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
0 GCCCAGCACAGGCAC TTTC TAGATAT TTCCCCCT TGC TGGAGAAGAAAGAGCCCCTGGTTTTAT T 

TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 

CI 

m TTGACATGGATGATTCTGAGGAGGAAGCTGTTATTGAATGTATAGAGATTTATCCAAATAAATAT 

! si 

CTTTATTTAAAAATGAAAAA 
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FIGURE 156 

MRERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCYSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites . 

amino acids 44-50, 150-156 
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FIGURE 1*7 

CCGGCGATGTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GAC CGT T CAATGTGGCT C TGAAACT GGGCCAT C TCCAGAGTGGAT GC TACAACATGAT CTAATC C 
CCGGAGACTTGAGGGACCTCCGAGTAGAACCTGTTACAACTAGTGTTGCAACAGGGGACTATTCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GATT TGTGTGACGGGCAAAAGCAACTTCCAGTCCTACAGC TGT GTGAGGTGCAAT TACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAG C T GAACAC AGT C TAT T T CAT T G GGG C CC AT AAT AT T C C T AAT G CAAAT AT GAAT GAAGAT G G 
C C C T T C CAT G T C T G T GAAT T T C AC C T C AC C AGG C T GC C T AGAC C AC AT AAT G AAAT AT AAAAAAA 
AGTGTGTCAAGGCCGGAAGCCTGTGGGATCCGAACATCACTGCTTGTAAGAAGAATGAGGAGACA 

Q 

GTAGAAGTGAAC TTCACAACCACT CCCCTGGGAAACAGATACAT GGCTCT TATCCAACACAGCAC 

w 

fjp; TAT CAT CGGGTTTTCT C AG GT G T T T GAGC C AC AC C AGAAGAAACAAAC GC G AGC T T C AG T GG T GA 

yj TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
r '4 AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
; !!! GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 

CI 

CAT GGGTGC T GGTGGCAGGGAT CTAT CTAATG T GGAGGCACGAAAGGATCAAGAAGACT T C CT T T 
Q TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
IfJ T CACACAAT T T GT TAC TTCACTGAATT TC T T CAAAACCAT TGCAGAAGTGAGGT CAT CC TT GAAA 

£j AGTGGCAGAAAAAGAAAATAGCAGAGATGGGTCCAGTGCAGTGGCTTGCCACTCAAAAGAAGGCA 
2f GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
q.| GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 

GT GAT CT AAGAAGCCAGAT TCAT CT GCACAAATACG TGGT GGTC TAC T TT AGAGAGAT TGATACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGTGCAGAACT TC TCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGAT CACAAGCC TGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE 158 



MSLVLLSLJUULCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV 
LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NANMNEDGPSMSVNFTSPGCLDHIMKYKKKCVKAGSLWDPNITACKKNEETVEVNFTTTPLGNRYMALIQH 
STIIGFSQVFEPHQKKQTRASWIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIKKTSFSTTTLLPPIKVLVVYPSEICFHHTICYFTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWLATQKECAADKWFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites . 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 



5 ' 
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FIGURE 1*Q 



AGC C AC CAG C GC AAC ATGA C AG T GAAGAC C C T GCAT G GC C CAGC CAT GG T C AAG T AC TTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGT^AAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 
M AC TCAGCT GAAGAAG 

a 
a 

m 



5 - 



Q 

m 
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FIGURE 160 



MTVKTLHGPAKVKYLLLSILGLAFLSEAAARKIPI<VGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCSVSFQLEKVLVTVGCTCVTPVIHHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



ACAC T GGCCAAACAAAAACGAAAGCAC TCCGTGC T GGAAGTAGGAGGAGAG TCAGGAC TCCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCTGAAGATGAGGAAAAGTTTGGAGGAGCAGCTGACTCAGGGGTGGAGGAGCCTAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
f*f GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
% ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
y GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAA.CC TGACTGGACCGCAGATCATTACCTTGAA 

■0 CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
L|| CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
%l CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
: a GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 

TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
G?l AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
g- CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
r* GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
rfj CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
*£j CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
U CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
.81 CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
j»| CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
X;J CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
■ *■ CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 

GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
C GGGAC T TAAAT AAAGGC AGAC GCTGTTTTTC TAAAAAAA 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSWYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPS.ECASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKY IHKRWALVWLACLL FAAALSLI LLLKKDHAKGWLRLLKQDVRSGAAARG 
RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWS RRELSAQGPVAW FHAQRRQTLQEGGWVLL FS P 
M GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
CI LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

I I 

'f\ Signal sequence: 

amino acids 1-20 

'""it 

?H Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites . 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site, 

amino acids 583-586 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 



APP ID=10063730 



Page 316 of 322 



FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCGATCAGGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAAC AT C C T G AC G T GGG AC AG C G G G C C AG AG G G C AC C C C AG AC AC G G T C T AC AG CAT C GAG TATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AAC CTGACGG T GGAGACGGGCAACC T CACGGAGC TC TAC TATGCCAGGGT CACCGC T 
GT CAGTGCGGGAGGC CGG TCAGCCACCAAGAT GAC TGACAGGT TCAGC T CT CT GCAGCACAC TAC 
C CT CAAGC CACC TGATG T GACCTGTATC TCCAA&G T GAGAT CGAT T CAGAT GATT GT T CATC C T A 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
T TC TAC CACT TAGAGC TCCAGGTCAACCGCACC T ACCAAATGCACC T T GGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 
u TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
^ CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
Q TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
Q AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
f|| TCCACAGCGGCATAGCCTGTCCGAGATCACCTACTTAGGGCAGCCAGACATCTCCATCCTCCAGC 
x'\ CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
fi GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
% \ GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
III CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
Q CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 

TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 
1, CCC T GGGGAT T TGCACAGACAGAACATCT GAC CCAAATGTGCT ACACAGT GGGGAGGAAGGGACA 

M CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
Ul CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
f I TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
%l GAGCAGCCCACAGAAC TGGAT T C T C T TT TCAGAGGC C TGGC CC TGAC T GTGCAGT GGGAGT C C TG 

2 AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
M ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 
f|| AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAATGACAGTGCAAGGAGGAAATGCAGGGAAACTCCCGAGGTCCAGAGCCCCACCTC 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 
GCCTGGAAAAGAACCAGGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
T TCC CAGC CAGGGCAACT GCC T GACGT T GCACGAT T TCAGC TT CAT TCCT C T GATAGAACAAAGC 
GAAATGCAGGTCCACCAGGGAGGGAGACACACAAGCC TT TTCT GCAGGCAGGAGTTTCAGACCCT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAA.TTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 
T C T T TAAAGTGC T TAAT AGTGCCTGG T ACAT GGGCAGTGC C CAATAAACGG TAGC TAT T T AAAAA 
AAAAAAAA 
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FIGURE i6a 

MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRI T RKS CNL T VE T GNL TE L Y YARVT AVS AGGRS AT KMTDR FS S L QHT TLKP PDVTC I S 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLH'QPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 

■CO ' 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 



ill 
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FIGURE i6s 



TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAAC ATG G 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATATGTCTACTAGAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AGTGTTGAACACAGTTTTGGATATTTTCCAAAAGATTTGATCAAGGTACTTCATAAATACACGGA 
AGAAGAGCTACATATTC CAGCAGAT GAGACAGAC T T TGTC TGCT T T GAAGGAGGAAGAGATGAT T 
TTAATAGTTATAATGTAGAAGAGCTTTTAGGATCTTTGGAACTGGAGGACTCTGTACCTGAAGAG 
T CGAAGAAAGC TGAAGAAGT T TCT CAGCACAGAGAGAAAT CT CC TGAGGAGT C T CGGGGGCGT GA 
ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
CAGAGAGCACCGAGGGGC TGCAGGGACAGCCC TCAGC TCAGGAGAGCCACCCT CACACCAGCGG T 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTGGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGAT GCTTACAAAGTCCTGAAAACAGAAATGAGTCAGAGAGGAAGTGGACAGTGCGT TATT 
CAT T AC AGCAAAGGAT T TC GT T G GC AT CAAAATC T AAGT T T GT T T TACAAAGAT T GT T T T TAGT A 
CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
AAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 



MAAAPGLLFWLFVLGALWWPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 



y 



Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site, 

amino acids 294-298 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-76 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 



Amidation site. 

amino acids 28-32 
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FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTGAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCTT GAGAATGAAGCAGAACAGAAGT TAATAGAGAGCATGT T GCAAAACC T GACAAAACCC GGG 
ACAGGGAT TTCTGATGGTGATTT CTGGATAGGGC TT T GGAGGAATGGAGATGGGCAAACATC TGG 
U TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
PI ATGAACCT TCC TGCGGAAG TGAAAAG T GT GT TGT GAT GTAT CACCAAC CAAC TGCCAATCC T GGC 

% CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGAGAGGTGTAACATGAAGCACAATTATATTTG 
W C AAG TAT GAACCAGAGAT TAAT CCAACAGC CC CT GTAGAAAAGCCT TATC T T ACAAAT CAACCAG 

31 GAGACACCCATCAGAATGTGGT TGT TACT GAAGCAGGTATAATTCCCAAT CTAATT TATGT TGTT 

hj AT AC CAACAATACCC CTGC TC T T AC T GAT ACT GGT TGC T T T T GGAACC TG T T G T T TCC AGAT GC T 

GCAT AAAAG TAAAGGAAGAACAAAAACTAGTCCAAACCAGT CTACAC T GT GGAT T TCAAAGAG T A 
.** CCAGAAAAGAAAG TGGCAT GGAAGT ATAAT AAC TCAT TGAC TT GGT TCCAGAAT T T TGTAAT TCT 

l M GGATCTGTATAAGGAATGGCATCAGAACAATAGCTTGGAATGGCTTGAAATCACAAAGGATCTGC 
Q AAGATGAACTGTAAGCTCCCCCTTGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCA 
a T T T AAAGAAT AT GCT G T G C T AAT AAT G GAG T GAGAC AT GC T TAT T T T G C T AAAGGAT GC ACC C AA 

ACTTCAAACTTCAAGCAAATGAAATGGACAATGCAGATAAAGTTGTTATCAACACGTCGGGAGTA 
% TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
^1 T G TAT AT T G TAT T G AAAT T T AC AG T G T G C AAAAG TAT T T T AC C T T T G CAT AAG T G T T T GAT AAAA 

G ATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAG 
01 AAACT TAT TAC T GTTGTCAACT GAAT T CACACACACACAAATAT AGTACCATAGAAAAAGT T TGT 

m TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
% AATAAGAAGCTAT TT CAT TAAGT G T GAT ATAAACCTCC TCAAACAT T T TACT TAGAGGC AAG GAT 

S\* TGTCTAAT T TCAAT T GTGCAAGACAT GT GC C T TATAAT TAT T TT TAGCT TAAAAT TAAACAGAT T 

T T G TAAT AAT G T AAC T T T G T TAAT AG G T G C AT AAAC AC TAAT G C AG T C AAT T T G AAC AAAAG AAG 
TGACATACACAATATAAATCATATGTCTTCACACGT TGCC TATATAAT GAGAAGCAGCTCTC TGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
ACTATAT TAGTATACAAAGAGGT CAT GT GG T T GAGAC CAGGTGAATAGTCAC TAT CAGT G TGGAG 
ACAAGCACAGCACACAGACATTTTAGGAAGGAAAGGAACTACGAAATCGTGTGAAAATGGGTTGG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGA7^AGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCVVMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
PYLTNQPGDTHQNVWTEAG 1 1 PNLI YWI PT I PLLLL I LVAFGTCC FQMLHKSKGRTKTS PNQ 
STLWISKSTRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites. 

amino acids 86-89, 255-258 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 266-269 

N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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